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Sequence Listing 



<210> 1 
<211> 1986 
<212> genomic DNA 
<213> Saccharomyces 



erevisiae 



<221> CDS 
<222> (1) . 



(1983) . 



<400> 1 

atg ggc aca ctg ttt cga aga 1 

Met Gly Thr Leu Phe Arg Arg 

1 5 




gtc cag aac caa aag agt gat tct 
Val Gin Asn Gin Lys Ser Asp Ser 
10 15 . 



gat gaa aac aat aaa ggg ggt tct c^tt cat aac aag cga gag age aga 
Asp Glu Asn Asn Lys Gly Gly Ser vk His Asn Lys Arg Glu Ser Arg 
20 25\ 30 



aac cac att cat cat caa cag gga tta gg< 
Asn His lie His His Gin Gin Gly Leu Gl} 
35 40 



cat aag aga aga agg ggt " 
■lis Lys Arg Arg Arg Gly 
45 



att agt ggc agt gca aaa aga aat gag cgt ggd\aaa gat ttc gac agg 
He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 



50 



55 



aaa aga gac ggg aac ggt aga aaa cgt tgg aga gat\tcc aga aga ctg 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 \ 80 

att ttc att ctt ggt gca ttc tta ggt gta ctt ttg ccg\ttt age ttt 
He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 \ " 

ggc get tat cat gtt cat aat age gat age gac ttg ttt gaA aac ttt 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe AspUsn Phe 
100 105 HO 



gta aat ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys A; 
115 120 i25 



it gtt 
Val 



etc cca caa ggt ata agt teg ttt att gat gat att cag get ggt\ aac 
Leu Pro Gin Gly lie Ser Ser Phe He Asp Asp He Gin Ala Gly^Asn 
130 135 I 40 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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tac tec aca tct t\:t tta gat gat etc agt gaa aat ttt gec gtt ggt 480 
Tyr Ser Thr Ser sir Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 \ 150 155 160 

aaa caa etc tta egt It tat aat ate gag gee aaa cat cet gtt gta 528 
Lys Gin Leu Leu Arg As> Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 \ 170 175 

atg gtt ect ggt gte att t\t aeg gga att gaa age tgg gga gtt att 576- 
Met Val Pro Gly Val He Se\ Thr Gly He Glu Ser Trp Gly Val He 
180 Y 185 190 

gga gac gat gag tge gat agt geg eat ttt egt aaa egg etg tgg 624 

Gly Asp Asp Glu Cys Asp Ser SeY Ala His Phe Arg Lys Arg Leu Trp 
195 200^ 205 



gga agt ttt tac atg etg aga aca 
Gly Ser Phe Tyr Met Leu Arg Thr 
210 215 



gtt atg gat aaa gtt tgt tgg 
k Val Met Asp Lys Val Cys Trp 
220 



ttt acg eta egt gca gca cag ggc ttc gaa tck act gat tat ttc ate 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser W Asp Tyr Phe He 
245 250 \ 255 

gca ggg tat tgg att tgg aae aaa gtt ttc caa aanctg gga gta att 
Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn\Leu Gly Val He 
260 265 \ 270 

ggc tat gaa ecc aat aaa atg^acg agt get gcg tat gaK tgg agg ett 
Gly Tyr Glu Pro Asn Lys Met Thr Ser -Ala Ala Tyr As^Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga ege gat agg tac ttt acg aW eta aag 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Ly^ Leu Lys 
290 295 300 

gaa caa ate gaa etg ttt cat caa ttg agt ggt gaa aaa gtt W "a 
Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val <5ys Leu 
305 310 315 \ 320 

att gga eat tct atg- ggt tct cag att ate ttt tac ttt atg aaa\ tgg 
He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys ^rp 
325 330 335 



672 



ttg aaa cat gta atg tta gat ect gaa ac\ ggt etg gac eea ecg aae 720 
Leu Lys His Val Met Leu Asp Pro Glu ThrWy Leu Asp Pro Pro Asn 
225 230 ^5 2 *° 



768 



816 



864 



912 



960 



1008 
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gtc gag get gaa ggt cct ctt tac ggt aat ggt ggt cgt ggc tgg gtt 
Val Glu Ala Glu GlA Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 \ 345 350 

aac gaa cac ata gat \ca ttc att aat gca gca ggg acg ctt ctg ggc 
Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 \ 360 365 

get cca aag gca gtt ccaVct eta att agt ggt gaa atg aaa gat acc 
Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 ^ 380 

att caa tta aat acg tta gcc\atg tat ggt ttg gaa aag ttc ttc tea 
He Gin Leu Asn Thr Leu Ala Vet Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 \ 395 400 

aga att gag aga gta aaa atg tt\ caa acg tgg ggt ggt ata cca tea 
Arg He Glu Arg Val Lys Met Le$Gln Thr Trp Gly Gly He Pro Ser 
405 n? 410 415 



1056 



1104 



1152 



1200 



1248 



atg eta cca aag gga gaa gag gtc aVt tgg ggg gat atg aag tea tct 
Met Leu Pro Lys Gly Glu Glu Val Trp Gly Asp Met Lys Ser Ser 

420 42X 430 

tea gag gat gca ttg aat aac aac act Vac aca tac ggc aat ttc att 
Ser Glu Asp Ala Leu Asn Asn Asn Thr &p Thr Tyr Gly Asn Phe He 
435 440 \ 445 

cga ttt gaa agg aat acg age gat get ttc\aac aaa aat ttg aca atg 
Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Vsn Lys Asn Leu Thr Met 
450 455 \ 460 

aaa gac gee att aac atg aca tta teg ata tca^ cct gaa tgg etc caa 
Lys Asp Ala He Asn Met Thr Leu Ser He Ser\pro Glu Trp Leu Gin 
465 470 475\ 480 

aga aga gta cat gag cag tac teg ttc ggc tat tAc aag aat gaa gaa 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Sek Lys Asn Glu Glu 
485 490 \ 495 

gag tta aga aaa aat gag eta cac cac aag cac tgg ticg aat cca atg 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 \ 510 

gaa gta cca ctt cca gaa get ccc cac atg aaa ate tat\tgt ata tac 
Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr ^s He Tyr 
515 520 525 



1296 



1344 



1392 



1440 



1488 



1536 



1584 
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ggg gtg aac aac cca act gWa agg gca tat gta tat aag gaa gag gat 1632 
Gly Val Asn Asn Pro Thr g\u Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 5A 540 



gac tec tct get ctg aat ttg Vcc ate gac tac gaa age aag caa cct 

Asp Ser Ser Ala Leu Asn Leu fcir He Asp Tyr Glu Ser Lys Gin Pro 

\ 555 560 

545 550 » -> = - > 

gta ttc etc acc gag ggg gac ggk acc gtt ccg etc gtg gcg cat tea 
Val Phe Leu Thr Glu Gly Asp GlyVThr Val Pro Leu Val Ala His Ser 
565 \ 570 575 

atg tgt cac aaa tgg gee cag ggt gW tea ccg tac aac cct gee gga 
Met Cys His Lys Trp Ala Gin Gly AlW Ser Pro Tyr Asn Pro Ala Gly 
580 58^ 590 

att aac gtt act att gtg gaa atg aaa"\:ac cag cca gat cga ttt gat 
lie Asn Val Thr He Val Glu Met Lys Ais Gin Pro Asp Arg Phe Asp 
595 600 \ 605 

ata cgt ggt gga gca aaa age gec gaa caA gta gac ate etc ggc age 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610 615 \ 620 

gcg gag ttg aac gat tac ate ttg aaa att gcV age ggt aat ggc gat 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala\ Ser Gly Asn Gly Asp 
625 630 635\ 640 



etc gtc gag cca cgc caa ttg tct aat ttg age 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser 
645 650 



tgg gtt tct cag 
Trp Val Ser Gin 
655 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 



1986 
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<400> 2 

Met Gly Thr Leu Phe Arg Wg Asn Val Gin Asn Gin Lys Ser Asp Ser 

1 5 \ 10 15 

Asp Glu Asn Asn Lys Gly dly Ser Val His Asn Lys- Arg Glu Ser Arg 

20 \ 25 30 

Asn His He His His Gin gL Gly Leu Gly His Lys Arg Arg Arg Gly 

35 \ 40 45 

lie Ser Gly Ser Ala Lys Arg. Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 55\ 60 

Lys Arg Asp Gly Asn Gly Arg \ys Arg Trp Arg Asp Ser Arg Arg Leu 

65 70 \ 75 o!° 

lie Phe lie Leu Gly Ala Phe l\>u Gly Val Leu Leu Pro Phe Ser Phe 

85 \ 90 95 

Gly Ala Tyr His Val His Asn SeV Asp Ser Asp Leu Phe Asp Asn Phe 

ioo /Uos no 

Val Asn Phe Asp Ser Leu Lys Va^Tyr Leu Asp Asp Trp Lys Asp Val 

115 120 y i25 

Leu Pro Gin Gly He Ser Ser Phe Vie Asp Asp He Gin Ala Gly Asn 

130 135 \ I 40 

Tyr Ser Thr Ser Ser Leu Asp Asp lW Ser Glu Asn Phe Ala Val Gly 
145 150 \ 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn ills Glu Ala Lys His Pro Val Val 

165 \170 115 ■ 

Met Val Pro Gly Val He Ser Thr Gly\lle Glu Ser Trp Gly Val He 

180 • 185 \ 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala Ais Phe Arg Lys Arg Leu Trp 

195 200 \ 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met VaV Met Asp Lys Val Cys -Trp 

210 ■ 215 \ 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr\Gly Leu Asp Pro Pro Asn 
225 230 W 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu ker Thr Asp Tyr Phe He 

245 250 \ 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 

260 265 \ 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala\ Tyr Asp Trp Arg Leu 

275 280 \ 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr We Thr Lys Leu Lys 

290 295 W° 

Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 \ 320 

' He Gly His Ser Met Gly Ser Gin He He Phe Tyt Phe Met Lys Trp 

325 330 \ 335 
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Val Glu Ala Glu Gly Wo Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 

340 \ 345 350 

Asn Glu His lie Asp sW Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 

355 \ 360 365 

Ala Pro Lys Ala Val Pr\ Ala Leu He Ser Gly Glu Met Lys Asp Thr 

370 \375 380 

lie Gin Leu Asn Thr Leu Via Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 \ ■ 395 400 

Arg lie Glu Arg Val Lys Me\ Leu. Gin Thr Trp Gly Gly He Pro Ser 

405 \ 410 415 

Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 

420 \ 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 

435 44b 445 

Arg Phe Glu Arg Asn Thr Ser AspV^la Phe Asn Lys Asn Leu Thr Met 

450 455 fh 460 

Lys Asp Ala lie Asn Met Thr Leu%er He Ser Pro Glu Trp Leu Gin 

470 \ 475 480 

Arg Arg Val His Glu Gin Tyr Ser p\e Gly Tyr Ser Lys Asn Glu Glu 

485 \ 490 495 

Glu Leu Arg Lys Asn Glu Leu His HisUys His Trp Ser Asn Pro Met 

500 505 \ 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 

515 520 \ 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala TyV Val Tyr Lys Glu Glu Asp 

530 535 \ 540 

Asp Ser Ser Ala Leu Asn Leu Thr He AspW Glu Ser Lys Gin Pro 
545 550 \55 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val pVo Leu Val Ala His Ser 

565 570 \ 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser PrA Tyr Asn Pro Ala Gly 

580 585 \ 590 

lie Asn Val Thr He Val Glu Met Lys His Gin *ro Asp Arg Phe Asp 

595 600 \ 605 

lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Ask He Leu Gly Ser 

610 615 620\ 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser\ G ly Asn Gly Asp 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin TVp Val Ser Gin 

645 650 • \ 655 

Met Pro Phe Pro Met 
660 
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<400> 3 
ATGGCGTCTT 
ATCTCCTATC 
AGCAACCTTC 
TCTAAATTTG 
ATGCGGTGCT 
CTGCTACGTT 
GATGACATTA 
TACTACGGAC 
GTCTTGATAT 
GTTCCTGGTG 
CTCGATTCCT 
AGGCAATGTT 
AAAAAAACCG 
GTTTGAAGCA 
TAATTGAAAA 
GCTTCTTACG 
ATATTTTTCA 
AGAAAAAGGT 
TATTTTTTTA 
TTGGGTTAAT 
GTTTGACTAC 
GCACCCAAAA 
TATTGTAATT 
AAGCTCAATT 
GATTTGTTAC 
GAGGTATATT 
GAGAGCCATG 
AAGGAGGCGA 
TGTTAATTTT 
TTTTTCCAAT 
ACGATGAATT 
GATGACGATT 
. TTGGACTGAA 
TAAATCCGCT 
AAT AG AC TAG 
GGGGTCGGAA 
GGGGCAACCT 
ATGTGAGAGA 
ATTGTTATGG 
GGTGTGCAAT 
GTATCACAAA 
GGAGGACCTC 
AAATGTATGT 
AAGGAAATTA 
CCGTTATATA 
TAACTAACTA 
CCTAGAAATT 



CCAAGAAGAG 
GACTTACCAA 
AGCGTCCGAA 
GAAAAAGATT 
TTTTTTTTCG 
AGATAAATTT 
AAGGATATTT 
AAGCCTTCGC 
GTACAATGAG 
TTATCAGCTC 
TACTTTAGGA 
CCTTGACAAG 
GCTTGGATCC 
GCTGATTTTT 
CCTTGCTGCA 
ATTGGCGGTT 
AAGTTAAAAA 
AGTGTTGATT 
AGTGGGTTGA 
GATCATATTG 
GTTTCTAACT 
CAGTGGCAGC 
ACATTAAACA 
GAATCAGTTT 
TAATTTACTG 
CAAAAATACA 
ATGGTTCGCA 
TGTTGTATGG 
TTATTAATAT 
GGTGCAATTA 
TGACATAGAT 
TTAAAGTCAT 
AAAGAAGTGT 
AGAAGTAAGA 
TCTTCCTTAT 
AACCAACTGA 
GTCATTGATT 
ATTTATGTTT 
ATGATGGTGA 
AAAGT TTGGC 
TTATGAAATC 
GCTCGGCAGA 
TCATTTTACC 
TTTTAAAAGT 
TCAGATATCC 
ACCGAACAGG 
AA 



CAAAACTCAT 
ATTCAAAGAA 
ACAQAATCTG 
GAATTTTATA 
CTGTTSGGAGA 

ggga^tatgc 
atcttataat 
agtctqctag 
ggatatcgaa 
aggattagaa 
aacgtctWg 
caatgctogc 
gaagggaaVt 

TTATCAC 
ATTGGTTAj 
ATCATAT 
TGTTCATTG^) 
TCTCACTCCA 
AGCTGAGGGC 
AAGCATTTAT 
TTTGAATAGA 
GCTTTTATCG 
TGTTAATATT 
TCGGTCTATG 
TTTAGTTTGG 
AATGTGCTCT 
CTATGGGAGG 
GGAAATGC C A 
TTAGGCTCCA 
TTCGATATAG 
GATGCATTAC 
GCTAGCGAAA 
TAAAAAATAA 
ACATTAAAGT 
GCTCCTGATA 
GAGAGGTTAT 
CCTCGGTTAA 
CAAACATTCT 
TGGAACTTTA 
AAACAAAAAG 
AAGCATGAAC 
ACACGTCGAT 
TTACAAATTT 
TTCATCAGGC 
AGTACGGACA 
GAAATAATAA 



AAGAAAAAGA 
ACCAACTCGC 
TTTCAAATAA 
TTGGGCGCTA 
CGACAATGCT 
TAGGCTCTTC 
GTGTTTAAGG 
CGGAAATGAA 
GTGACCATCC 
AGTTGGTCGT 
GGGTAGCTGG 
TTGAACATTT 
AAGCTGCGAG 
CTATTGGATT 
AGCCTAATAA 
AATTTAGAGG 
GTACAGCAAC 
L TGGGTTCACA 
'ACGGAAATG 
TGTGAGT 

tWtcgggatc 

GGrTGAAATGA 
T^TTTTTGC 
GG^AAGCAAT 
lTTTTT 
ACTOSlTTTCTA 
AGTTAGTTCT 
GTTGGGTAAG 
GATGATCTTA 
AGAAGACATT 
AATTTTTAAA 
AATTATTCCC 
CGAAATGqCG 
TACTAAAT^A 
TGAAAATT 1 
TATTATACT. 
TGATGGAACA> 
ATTAACTGTT 
CCAATATTAG 
GTTTAATCCT 
CTGCTGCGTT 
AT AC TTGG AC 
CTATTACTAA 
CATGGTGACT 
TAAGTTTTGT 
ATGAGATAAA 
23 



AAGAAGTCAA 50 

GCTTTGAGTG 100 

ATCAAGAAAA 150 

TTTTGGGAAT 2 00 

GTTTTCGACC 2 50 

AGACTTGTTT 3 00 

ATGCACCTTT 3 50 

GTTCAAGTTG 400 

TGTTATTATG 4 50 

TTAATAATTG 5 00 

TCTATGCTGA 5 50 

AATGC TTG AT 600 

CAGCTCAGGG 650 

TGGAGTAAAG 7 00 

CATGTTAAGT 7 50 

AACGTGATAA 800 

ATTGT AC AT A 850 

GGTTACGTAC 9 00 

GTGGACCGAC 950 

CTCGATGGTT 1000 
TTTGATTGGA 1050 
AAGATACAGG '1100 

TAACCGTTTT 1150 

AAATTGTTGA 1200 
TTCCCGTTCT 1250 
ACTTTTAATA 1300 
ATGCTTCCTA 1350 
AAATATGTGC 1400 
ATCAAACAAA 1450 
GATAAGGACC 1500 
'AAATGTTACA 1550 
ACGGTCTTGC 1600 
TCTAAATGGA 1 .6 5 0 
TACTAACCCA 17 00 

TTGCGTTCAC 17 50 

ATAATCCTGA 1800 
AAAGTTGAAA 1850 
/TTATT AGGGT 1900 
ICCTTGGTTT 1950 
ICTAATACAA 2000 
TGATCTGAGA 2 050 

ATTCAGAGCT 2100 
CTCTTGAAAT 2150 
CGQTACCAAA 2 200 

AGATTGCAAT 2 250 

T C TOP AT AAA 2 300 

12 
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: !=s l 



<400> 4. 
ATGCCCCTTA 
ATCTGAAGAG 
CTTCCAAATC 
ATCGATTCTT 
TCTTCTCTTC 
CGGAGCGAAT 
AAAGAAGGTC 
CACCGGTGGG 
TTAGAAAACG 
GCTCAACAAT 
TGAGATCACG 
TGTTTGTCTG 
ATTTTCTTAA 
GTAATTGGCT 
TAGCTGCCTT 
ATGCCCTAGA 
GTAGATTTAG 
TGGGAATAGA 
GATAGAGGAC 
CTAGTTCCTT 
TCAATTTCAA 
CATCCTCGTT 
TATATTAAAA 
TCTTCTTCTT 
AATGTTATTG 
TCAATTATCT 
TTGACAATGA 
TCAGGACTCG 
AGTGCTGATT 
ACATGGCTGC 
TTCTCATCGT 
TTACTTAAGG 
GACTCTTAGC 
GTGGAAAAAA 
CTACATTTTA 
TGGGCCAGAT 
GACCATTTCT 
GCAAAGGATG 
TGATCAGAAC 
TCTAATAACA 
TTGCTGCTTA 
GATAATTCTT 
TAACATTTGT 
GCGCTGGAGC 
TCTTTCAATT 
TAAAGACTCG 
AGAACTTTAC 
GACACCGATA 
ACGCACATGG 
TATGGGGCGG 
AAAAAGCAAA 
TTCCAAGAAA 
AAGTAGCAGA 
AGGACATATA 
ATATCTGTAC 
CCAAATCACA 
TGCTGGGATC 



TTCATC^GAA 
GTGGTGcVcG 

ccaccatAag 

GTTGTTGGTT 
CTTTACAA 
CACGGGTCC 
TTAAGGCGA^ 
CTCGAGCTTT 
TTTGTGGGGT 
TCTCACTCTT 
CACTTGTTGC 
TCTTACTCTT 
GAGACTATTT 
TGGACTATTT 
GGAATTTCAA 
GTCCGTTCAT 
CTTTGTGTAG 
GAAGTTCTAA 
TGTTGCTTTA 
TTTGATCTTT 
AGTTTCACAT 
CTGTTATCCA 
AAAAAGTGTT 
TCGGCTCAGT 
TTGATGGTAA 
CTTTGATTCA 
AACTGGGTTG 
TGGCTGCTGA 
GCTAACCTTG 
ATATGACTGG 
TCTTTCTATT 
CTTAAATATG 
CGTATGAAAA 
AGCAGTTATA 
TGAAGTGGGT 
TGGTGTGCAA 
TGGTGTTC C A 
TTGCAGTTGC 
CTTGGCTCTG 
TTGCTATATT 
TGTAACTGAA 
ACGCATTGCT 
CATACTGTCT 
TTCCATGCTT 
TTCTTGTATA 
TTGGATTAGT 
TTTCTTCGAA 
TATTTAGACT 
GACTCAACAA 
GCTTGATTGG 
AGAACAACGA 
AGTCCTGTTA 
GGCTGCGCCA 
AATCATAATA 
ATTTTATCTG 
CCTGTCGTGA 
AAAGCTATCG 



AAAGCCGACG 
ATGAGGATTC 
AAATCGAACG 
CATTGGGTGT 
CAATGCCTGC 
TTGCCTGACC 
ACATCCTGTT 
LAAGGCAA 
iGAACTTTTG 
CtTTTATATT 
T^TTCAACA 
TAOCTTTTTTT 
CTG^ATGTGT 



CTGTiTTGATT 
GTCATCTTAT 
AACAAGTTAC 
CGTATAATGA 
CTAC^TOTGT 
TTATtDCMCT 
CAGCTCAMG 
CGAGTTTA^T 
GCTTTGAAC 
TTGTGGGTT 
GTTCATGTTT 
C AG TGGT AT A 
GGCCTCTATG 
GATCCAGCTG 
CTACTTTGCT 
CACATATTGG 
CGGCTTTCGT 
ATTCTGTTCC 
TTTCATGTTG 
GTAATATAGA 
GTTCCGCATT 
TGAGGCACCA 
AGTATATTAA 
AAAGCTGTTG 
CAGGTATTGA 
GAACTCAAAG 
ATCGCTGCAA 
ACTCTCTTGA 
CTGTGATGAC 
TTTGGAGGGC 
GCATTCTTTA 
TGCATCTATG 
TGCTCTATTA 
AATTGCAGAG 
TCAGACCTTG 
TGTCTATGTT 
TCACCGGAGA 
AACTTGTGGT 
ACTATGGAAG 
TCTGAGATTA 
AACCTTGTAC 
GTGAAGGGTG 
CGTGTGGACA 
CTGAGTATAA 



GAGAAACCAT 
GCAAAAGAAA 
GAGGAGGGAA 
GTGTGTGTAA 
GAGCTTCCCT 
CGCCCGGTGT 
GTCTTCATTC 
ACAATGCGCT 
GTGAAGTCTA 
GGGATTTGGA 
TCACTCAAAC 
TTTTTTTGAT 
AAGGTAAGCG 
GTTAACTTTA 
TGCCAAATCT 
TTCCTTTACT 
AGTAGTGTTT 
GGAAAGTGTG 
ATGTATATGT 
TGCTTTTCTC 
CACATGTCTT 
CCTCCCGACC 
CATCTTTGTT 
^TTGCTATGGT 
■TTGATAGTA 

'gggtggaa 
cMttagagt 
cctggctact 
atatgaagag 

TTCAGAACAC 
ATGTTACGTT 
AATTAATAGG 
GTTGAT^GTT 
CCATGGGGGT 
GCTCCTC 
GGCGGTGA 
CAGGGCTTT 
ATATCTGCTT 
TTATTCTACT 
CTGACATTGG 
GATTAGACAA 
CAGTTTCTTA 
ATTGAATTTT 
CCAATTAGCG 
GTCTTTTATT 
GTCACTTGGT 
CGATTGCCCC 
CAGCATGTAA 
ACCGAAGGGA 
AAGGCCACAC 
GAAGCAGGTG 
GATGATATCT 
ATAATATTGA 
ATTTTGTGAT 
CTGTCAAAGG 
GAGTACCATG 
GGTC TAC ACT 



CGACGCCGCC 
CCACACGAAT 
GTGGTCGTGC 
CCTGGTGGTT 
CAGTATGTAA 
T AAGC TC AAA 
CTGGGATTGT 
GATGGTTTAT 
CAAAAGGTGA 
TTGGATCTGA 
TTTAATTCCA 
GTGAAACGCT 
TTCCAAGGAC 
GGATATAAAA 
GTTGCTAGAC 
GTCGTTGCGT 
TATGTTTTGT 
TTCAGGCTGT 
GTAATTAAAG 
AATTTTTTTC 
GAATTTCGTC 
CTGCTATGGA 
ACGATCTGCA 
AGAGATGGGC 
TCTTAACTAA 
C AC ATGTC AC 
TCGAGCTGTA 
TTGTCTGGGC 
AAAAATATGT ■ 
AGAGGTTCTT 
TCTTTCTTCA 
TACGTGATCA 
TCTACCAACG 
CTTGTATTTT 
GTGGCGGGGG 
AACATTGGTG 
CTCTGCTGAA 
.ATACTTTTGA 
lTATCAAT 
'GATTATTT 
A^ATGAATT 
GCTTCGACGA 
GCTATGGAAA 
TTATTCTGCT 
TCTTCTTAAT 
TCCTTAATAT 
AGGATYTCTTA 
TGAGAATGAC 
GGTGAQACGA 
CTGTTGTGGG 
AAAACGGAGT 
. TTTGGGA^AG 
TTTTCGAG^A 
TGTATGATG 
TCAGAGTAT<! 
ACATGGGAAT^ 
GCTGGTGAAG 



1000 



50 ■ 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

850 

900 

950 

1050 
-1100 
1150 
1200 
1250 



1300 
1.3 50 
1400 
1450 
1500 
1550 

1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2050 
2100 



1600 



2150 
2200 
2250 
2300 
2350 
2400 
2450 
2500 
2550 
2600 
2650 
2700 
2750 
2800 
2850 
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CTATAGATCT 

GCTCATTTCT 

AGATCCCAAA 

GATTCCAACT 

GGTCTTGTTG 

GCCTTTCTCA 

CATGTGACTC 

GATTACCGAA 

ATACCAACGG 

TTGCATCCCC 

GGTGTCTGAA 

GTCCTAAGTG 

ATTCAACCCT 

CGCCGGCTAA 

GATATCATGG 

CGGAGGTAAC 

TTGAATGGTC 



ACTACATTAT 

CTTATGGAAT 

TACTGGTCAA 

GTATCCTTCG 

GATATGGTTT 

AAAAGGCTTG 

TTGCTTATAA 

TGCTCCTGAG 

AACGAGCATA 

TTTCAGATAT 

AGCAGGAGTT 

CGGGGTACAT 

TCCGGAATCA 

CCTGTTGGAA 

GAAACTTTGC 

GGGTCTGATA 

GGAGCGTATT 



GTT&CTCCTA 
TGCTGATGAT 
ATCCOTTAGA 
TCCTG^TGCA 
TCAGCT' 
CTCAGT, 
ATCCTCCG' 
ATGGAAAT 
CGTATAC 
TCACTTCTGC 
TACAATGTGG 
GTGTGCAAAA 
AGACTTATAT 
GGGCGCGGGA 
TTTGATCGAA 
TAGGACATGA 
GACCTGAAGC 
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AGATGATGGC 
TTGGATGACA 
GACAAAGTAA 
TTATCAGTCT 
GCTTACAAAG 
TTGAGGTGCT 
TGGTTTGTTC 
ACTCATTATA 
CTTAACCAGT 
t TCACGAGGAG 

tggggatga 
:gtggcgtg 

AGAATAC 

cgoagagtgg 
gatXtcatga 
ccagcstccac 

TGTGAI 



GCGTGGTGCC 
CCAAGTATCA 
GTGATTTCTT 
TTTTGTTTTC 
CTGTTTCTGA 
AAAGTTGATA 
TGCTTTTTCA 
CGGAGTGGGG 
CTCCCGACAG 
GACGAAGATA 
AACAGTACCC 
GCAAGACAAG 
AATCACTCTC 
TGCCCATGTT 
GGGTTGCCGC 
TCTGGCATAT 

3685 



3300 



2900 
2950 
3000 
3050 
3100 
3150 
3200 
3250 

3350 
3400 
3450 
3500 
3550 
3600 
3650 



-.PS 
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<210> 5 
<211> 2427 
<212> cDNA 
<213> Arabidopsis\haliana 



<400> 5 
AGAAACAGCT 
TTCTAAATTC 
TTAATTTCAA 
AGAAACCATC 
CAAAAGAAAC 
AGGAGGGAAG 
TGTGTGTAAC 
AGCTTCCCTC 
GCCCGGTGTT 
GTCTTCATTC 
ACAATGCGCT 
GTGAAGTCTA 
AATGAAACTG 
ACTCGTGGCT 
TGATTGCTAA 
GCTGCATATG 
GACTCTTAGC 
GTGGAAAAAA 
CTACATTTTA 
TGGGCCAGAT 
GACCATTTCT 
GCAAAGGATG 
CGATATATTT 
CATGGGACTC 
GGCGGGCTTG 
GCAAAAGAAC 
AGAAAAGTCC 
GCAGAGGCTG 
CAAAGGTCAG 
ACCATGACAT 
TACACTGCTG 
GATGGCGCGT 
ATGACACCAA 
AAATTACCGA 
GATACCAACG 
GTTGCATCCC 
AGCTGTCTGA 
CGTCCTAAGT 
GATTCAACCC 
CCGCCGGCTA 
TGATATCATG 
CCGGAGGTAA 
TTTGAATGGT 
TTAAGCTGTC 
ATCATCAATT 
AGCCTGAGAA 
CTTATTGAAT 
AAAACTTGCT 
GGGTTAAAAA 



CTTTGTCTCT 
CTGGACGAGA 
GTGACAGATA 
GACGCCGCCA 
CACACGAATC 
TGGTCGTGCA 
CTGGTGGTTT 
AGTATGTAAC 
AAGCTCAAAA 
CTGGGATTGT 
GATGGTTTAT 
CAAAAGGCCT 
GGTTGGATCC 
GCTGACTACT 
CCTTGCACAT 
ACTGGCGGCT 
CGTATGAAAA 
AGCAGTTATA 
TGAAGTGGGT 
TGGTGTGCAA 
TGGTGTTCCA 
TTGCAGTTGC 
AGACTTCAGA 
AACAATGTCT 
ATTGGTCACC 
AACGAAACTT 
TGTTAACTAT 
CGCCATCTGA 
AGTATCCCAA 
GGGAATTGCT 
GTGAAGCTAT 
GGTGCCGCTC 
GTATCAAGAT 
ATGCTCCTGA 
GAACGAGCAT 
CTTTCAGATA 
AAGCAGGAGT 
GCCGGGTACA 
TTCCGGAATC 
ACCTGTTGGA 
GGAAACTTTG 
CGGGTCTGAT 
CGGAGCGTAT 
CTGTCAGCTT 
CATCATCATC 
TGATACTTTG 
GTAAATTATA 
GCNGCCATGT 
AAAAAAAAAA 



:tcgactgat 

JTTGACAAAG 
;CCCCTTAT 
TC* 



TGAAGAGG 
TTCCAAATCC 
TCGAJTCTTG 
(pTTCC 
GGAGC 

AAAGAA&GTC 
CACCGGTGGG 
YcG 



TTAG 
CTATGTT' 
AGCTGGTA 
TTGCTCCT' 
ATTGGATATG 1 
TTCGTTTCA< 
GTAATATAG 
GTTCCGCATT 
TGAGGCACCA 
AGTATATTAA 
AAAGCTGTTG 
CAGAGCGATT 
CCTTGCAGCA 
ATGTTACCGA 
GGAGAAAGGC 
GTGGTGAAGC 
GGAAGGATGA 
GATTAATAAT 
ATCACACCTG 
GGGATCAAAG 
AGATCTACTA 
ATTTCTCTTA 
CCCAAATACT 
GATGGAAATC 
ACGTATACAA 
TTCACTTCTG 
TTACAATGTG 
TGTGTGCAAA 
AAGACTTATA 
AGGGCGCGGG 
CTTTGATCGA 
ATAGGACATG 
TGACCTGAAG 
ATGTGAATCC 
GTCATCATCA 
GTgCGAAATT 
CAATCCTATC 
TTGTTTGTCT 
AAAAAAA 



CTAACAATCC 
TCCGTATAGC 
TCATCGGAAA 
TGGTGCACGA 
CACCATAAGA 
TTGTTGGTTC 
TTTACAACGC 
ACGGGTCCTT 
TTAAGGCGAA 
CTCGAGCTTT 
TTTGTGGGGT 
TGGAACACAT 
AGAGTTCGAG 
CTACTTTGTC 
' .GAGAAAAA 
.CACAGAGG 



JTTGATGGTT 
!ATGGGGGT 
ITCCTCTGG 
GC^CGGTGATG 
CAQGGCTTTT 
GCCSCAGGAT 
TGTi^ATGAGA 
AGGG^GGTGA 
CACACOTGTT 

AGGTGA^^ 0 

;gg 

ATTGATT 

";g 

CTATCGC 
CATTATGT 
TGGAATTGCT 
GGTCAAATCC 
TACTCATTAT 
GCTTAACCAG 
CTCACGAGGA 
GATGGGGATG 
AGCGTGGCGT 
TAAGAGAATA 
ACGC AG AGTG 
AGATATCATG 
ACCAGGTCCA 
CTGTGAATAT 
AATACTTTGA 
TGATGCTCAA 
CTCAATACCT 
TAATGTTTGA 
TGTCAAAAGC 



CTAATCTGTG 
TTAACCTGGT 
AAGCCGACGG 
TGAGGATTCG 
AATCGAACGG 
ATTGGGTGTG 
AATGCCTGCG 
TGCCTGACCC 
ACATCCTGTT 
GGGAAGGCAA 
GGAACTTTTG 
GTCACTTGAC 
CTGTATCAGG 
TGGGCAGTGC 
TATGTACATG 
TACGTGATCA 
TCTACCAACG 
CTTGTATTTT 
GTGGCGGGGG 
AACATTGGTG 
CTCTGCTGAA 
TCTTAGACAC 
ATGACACGCA 
CACGATATGG 
GTGGGAAAAA, 
GGAGTTTCCA 
GAAAGAAGTA 
GAGGTGCTGT 
TGGACAGAGT 
GTATAAGGTC 
CTCCTAAGAT 
GATGATTTGG 
rTTAGAGACA 
.CGGAGTGGG 
iTCCCGACA 
lCGAAGAT 
GAGTACC 



GGOAAGACAA 
CAATCACTCT 
GTGC^CATGT 
AGGGT 
CTCTG 
CATGAT^CTCT 
AAGAGAO 
CTCACAA^GA 
CTTTAATOTT 
ACGATAAC^C 
ATCAATTTG^ 
2427 



CVTGCCG 
BCATA 



450 



1000 



50 
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300 
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600 
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1100 
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2400 
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<210> 6 
<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 
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<400> 6 

MPLIHRKKPT 

IDSCCWFIGC 

KEGLKAKHPV 

LCWVEHMSLD 

I G YE EKNMYM 

VPHSMGVLYF 

KAVAGLFSAE 

MLPKGGDTIW 

GRMISFGKEV 

GIKAIAEYKV 

PKYWSNPLET 

FTSAHEEDED 

KT Y I RE YNHS 

IGHDQVHSGI 



EKPSTPPSEE 

VCVTWWFLLF 

VFIPGIVTGG 

NETGLDPAGI 

AAYDWRLSFQ 

LHFMKWVEAP 

AKDVAVARAI 

GGLDWSPEKG 

AEAAPSEINN 

YTAGEAIDLL 

KLPNAPEMEI 

SCLKAGVYNV 

PPANL.LEGRG 

FEWSERIDLK 



whdedsqkk 
lynampas^p 
lelwegkq 1 
rvravsg: 
ntevrdqt 

APLGGGGG- 
APGFLDTDIF 
HTCCGKKQKN 
IDFRGAVKGQ 
HYVAPKMMAR 
YSLYGVGIPT 
DGDETVPVLS 
TQSGAHVDIM 
L 



PHESSKSHHK 
QYVTERITGP 
DGLFRKRLWG 
ADYFAPGYFV 
RMKSNIELMV ■ 
WCAKYIKAVM 
LQTLQHVMR 
ETCGEAGEN 
IPNHTCRDV 
GAAHFSYGIA 
EK^YVYKLNQ 
AGYMCAKAWR 
GNFAfjIEDIM 
' 67 



KSNGGGKWSC 
LPDPPGVKLK 
GTFGEVYKRP 
WAVLIANLAK 
STNGGKKAVI 
NIGGPFLGVP 
MTRTWDSTMS • 
GVSKKSPVNY 
WTEYHDMGIA 
DDLDDTKYQD 
SPDSCIPFQI 
GKTRFNPSGI 
RVAAGGNGSD 
1 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 
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<210> 7 
<211> 643 
<212> cDNA 
<213> Zea mays 

<221> CDS 
<222> (1) . . (402) 

CGG°GAG AAA ATA GCT GCT Tt\ AAG GGG GGT GTT TAC TTA GCC GAT GGT 48 
Arg Glu Lys He Ala Ala Leu\ Lys Gly Gly Val Tyr Leu Ala Asp Gly 
1 5 \ 10 15 

GAT GAA ACT GTT CCA GTT CTT \gT GCG GGC TAC ATG TGT GCG AAA GGA 96 
Asp Glu Thr Val Pro Val Leu 9.er Ala Gly Tyr Met Cys Ala Lys Gly 
20 \ 25 30 

TGG CGT GGC AAA ACT CGT TTC AdC CCT GCC GGC AGC AAG ACT TAC GTG 144 
Trp Arg Gly Lys Thr Arg Phe Set Pro Ala Gly Ser Lys Thr Tyr Val 
35. 4(W 45 

AGA GAA TAC AGC CAT TCG CCA CCcVcT ACT CTC CTG GAA GGC AGG GGC 192 
• Arg Glu Tyr Ser His Ser Pro Pro fcer Thr Leu Leu Glu Gly Arg Gly 
50 55 \ 60 



ACC CAG AGC GGT GCA CAT GTT GAT ATfc ATG GGG AAC TTT GCT CTA ATT 240 
Thr Gin Ser Gly Ala His Val Asp 114 Met Gly Asn Phe Ala Leu lie 
65 70 \ 75 80 

GAG GAC GTC ATC AGA ATA GCT GCT GGgVgCA ACC GGT GAG GAA ATT GGT 288 
Glu Asp Val He Arg He Ala Ala Gly kla Thr Gly Glu Glu He Gly 
.85 \90 95 

GGC GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 336 
Gly Asp Gin Val Tyr Ser Asp He Phe Lyfe Trp Ser Glu Lys He Lys 
100 105 \ HO 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG ^GCCGACCCG TTTATTGCGTTCC 391 
Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACTCTGGAT TTTGCTTaVa TATTGTAATT TTTCACGC 449 
TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGAC&C CAATGCGATA CGATGTTG 507 
TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 565 
TGAAATTGTG TAGTCGTTTT CTTTACGATT TAATANCAA^ TGGCGGAGCA GTGCCCCA 623 
AGCNAAAAAA AAAAAAAAAA 643 
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<210> 8 
<211> 115 
<212> PRT 
<213> Zea mays 
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<400> 8 



Arg Glu Lys lie Ala Ala LeA Lys Gly Gly Val Tyr Leu Ala Asp Gly 
3 r \ m 15 



1 5 \ 10 



Asp 



Glu Thr Val Pro Val Leu \er Ala Gly Tyr Met Cys Ala Lys Gly 



20 



25 



30 



Trp Arg Gly Lys Thr Arg Phe SeWpro Ala Gly Ser Lys Thr Tyr Val 

4n(/ . 45 



35 



Arg Glu Tyr 
50 



Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
55 \ 60 



Thr Gin Ser Gly Ala His Val Asp life Met Gly Asn Phe Ala Leu lie 
65 70 \ 75 80 

Glu Asp Val lie Arg He Ala Ala GlyWa Thr Gly Glu Glu lie Gly 
85 \ 90 95 



Gly Asp Gin Val Tyr Ser Asp He Phe lVs Trp Ser Glu Lys He Lys 
100 105 \ 110 



Leu Lys Leu 
115 
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<210> 9 
<211> 616 
<212> cDNA 

<213> Neurospora crassa 
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<400> 9 

ggtggcgaag 

agatggatct 

gtntacngtt 

tggcgcccga 

actttgactn 

tggcacagtg 

aaatgaagag 

ccgcatgaac 

tcacgtggat 

tggcggcagg 

cttaaatatg 

acataggtta 

aaaaaaaaaa 



acganggcgg 

accctctaga 

tntatgggta 

tcccgggacg 

aggggcacat 

aaccttatga 

atacaatcct 

cagaacggtt 

attctaggaa 

tcgaggcgat 

tagaaaaggt 

ctcaatagta 

aaaaaa 



aagttggfegg 

gacacgaigta 

ggaagcadfac 

acaacgcctc 

tgaccacggt 

gtttggggua 

gcgggctcai 

caatccgag 

ggcagaatct) 

acaattgagg 

tgaaatttat 

tgactaatta 



ctaacgagaa 
ccnttgcacc 
ggagcgagcc 
tttagatgac 
gtgattttgg 
cctgtgcaat 
aaataaccgt 
ggagggccga 
aaacgagtac 
.ttttattac 
aagagtaat 
.aaaaaaatt 
61 



tgacnctcgg 
cagcctcaag 
tacatctatc 
gatcgatacg 
gcgaaggcga 
aaggggtgga 
ggtcgagatg 
atacggcgga 
attcttaaag 
tagtaatatt 
taaatacggc 
ttttttctaa 
6 



50 
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200 
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<210> 10 
<211> 1562 
<212> genomic DMA 
<213> Arabidopsis thaliana 
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<400> 10 
ATGAAAAAAA 
GGTGACGATG 
TGATTCTGGT 
AGAGAATACA 
TCATAAGAAG 
TATTGTCTCC 
GACCCTGATT 
TCCTCATTTC 
GGTTAGTACT 
AAATAGACAT 
GCTAGTCGGT 
GTGATTTTCC 
CTATGTCATG 
GGTTTGATTT 
TCTAGAGAAA 
CATATGATTT 
GCCTCACAGT 
CGAGAACGAA 
TTTTCGTCCT 
TACATCAAAC 
TCAGATGAAG 
ACCCTTTGCT 
CTACTTCCAT 
AACTCCCCAG 
ACATTGGATT 
TTAACAGAGG 
GAGAGGAGTT 
ATAAGCAACC 
GCGAGCTTAG 
TGGAGTTTCG 
TTATGAAGCA 
GTCAATGAAT 



TATCTTCACA 
ACCTCGATGT 
TCCAGGAAAC 
AGCCAAGTAG 
AGTGGTGGAT 
CTTCACCAGG 
TGGATGATTA 
GGTTCGACCA 
TTCCAAGATA 
AAATTTGGGG 
AATGTGAGTG 
ATTTTAAATG 
AGAATTATAA 
GCAGAGATGC 
AAATGCGGGT 
CAGGTACGGC 
TCCTACAAGA 
GGAAAGCC AG 
CCATTTCCTC 
ACTTTGTTGC 
ACATTTGCTT 
GGTCAGACGG 
CTACCAAAGT 
GTTAACTACA 
CTCACAAGGA 
AGCTGATGAC 
GATACACCGG 
AGAGATTAAG 
CAGCTTTGAA 
CATACATCTA 
GATTTCAATT 
GA 



TTAT'^CGGTA 
GTCAAGCTGT 
GGAGGTteACC 
TGTCTGOTGT 
GGTTTAGGCT 
TGCTTCAQCG 
CCAAAATGCT 
AATCACTTQT 
TATCATT 
GATTATTj 
TTATGT 
AAGCTAGAAA 
GGACACTATG 
CACATCTTAC 
ATGTTAACGA 
CTGGCTGCTT 
CCTCAAACAA 
TG AT AC TC C T 
AACCGTACCA 
ACTCGCTGCG 
CTGGCAACAC 
CATCAGAGGA 
GTTTCACGAC 
CAGCTTACGA 
GTTGTGCCTT 
TCCGGGAGTG 
AGGTTTTGAT 
TATGGAGATG 
AGTCGATAGC 
TACTTAAAGA 
ATTAATTATG 



GTCATAGCGA 
GGGTAGCAAC 
AGCTAGAGGT 
AGCAGCTGGT 
ATGGTTCGAT 
ATCGAATGAT 
CCTGGTGTCC 
ATACCTCGAC 
GGAC ATTTGC 
ATATCAATAT 
ATAGTTAATG 
GTTGTCGTTT 
VTAAATGTAGC 
lTGGAACATT 

:aaaccatc 
c&ggccaccc 

TTGGTGGAAA 

ct&ccatagc 

CCCCTTCATG 
CCATGGGGTG 
ACT.OGGTGTC 
CCTCCGAGAG 
AGAAQTAAAC 
G ATGGAT CGG 
ACAAGACAAG 
CCAGTCU^CTT 
GTATGG. 

gagatgAgac 

TTGAACACCG 
CGAGATCQJCA 
AATTAGC 

5 



TACTCGTTGT 
GTGTACCCTT 
ACGGCTGGAC 
TATATCCGAT 
GCAGCAGTGT 
GTTGTACTAT 
AAACC CGGGT 
CCTCGTCTCC 
ATAATGAACA 
CCATTTATAT 
TGAGTGTTAT 
AATAATGTTG 
TTAATAATAA 
TGGTGAAAGC 
CTAGGAGCTC 
GTCCCGTGTA 
AAACTAGC AG 
CTAGGAGGAC 
GCGCCGCAAG 
GGACGATCTC 
CCTTTAGTTA 
TAACCAATGG 
CGCTTGTCGT 
TTTTTTGCAG 
AGTGTTGCCT 
GCATATATGG 
GGAGGATTCG 
GGTTAATTTG 
TAGAGATTGA 
CTTAAAGAGA 
TGTTAATGCC 
62 



50 
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<210> 11 
<211> 3896 
<212> genomic DNA 
<213> Arabidopsis the 
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,iana 



<400> 11 
ATGGGAGCGA 
TTTTTTCTTG 
TTCACGGCGA 
TCGACGCAGC 
GGACTTCAAT 
ATCTTCATTT 
AATTCCAAGC 
TCGTTCATTA 
AAAACAGCTG 
TAGCGCAATG 
CAAGTTTTTC 
CTTGTGCATT 
TGGTTTACAG 
TAGATCCTTA 
AGTGGTCTTT 
TTTCGGATTT 
TGTGATATAA 
TTTCTACTGT 
GAAGCAAATG 
CAAATTGGAA 
TCAGGCTAAT 
TCTGGTCGTC 
CCGTGGCGGC 
TCAGATACTT 
TTGAAGTGGC 
CCTACTATCC 
TGTTGTGACT 
GAGCTCCTCT 
GTAACGTTTG 
TTTAAGTAGT 
TGAAAGTATT 
TAGATCTTGA 
GTCAGCGGCC 
TTCAGATTAT 
CTTTTTATTT 
TGACATGCGC 
GGTTGTTGTC 
AAGAATTGCA 
TGCAAAGAAA 
CAAAATATTC 
ACTAGCGGTT 
CCAAGAATGT 
TACAGAAACA 
CCCTTTTGTC 
GCAATAGAAG 
GAAGTACGTA 
TTGCTGGCTT 
TGTTCATATG 
CCTTATTATT 
GTTTACAGTT 
ATTCTCTAGT 
TCTTTGTGAA 
TGATGACCCT 
AAAATGTATT 
• ATTCTCAATA 
GTTTGCAATA 
GCTATTAAGC 



AT TCG AAATC 
ATTTGCGGTG 
CTACTCGAAG 
TACGAGCGTG 
CCGCTCGACC 
CCTTCGCTCC 
GAAATATAGC 
GTCAACAGTG 
ACTCGGCGAG 
AATGTGTAAT 
AGAGTGCTCA 
GTGATTCTTT 
CTTCTTTCTG 
TAATCAAACA 
CAGCCATCAC 
TTCTTTCTTT . 
TATGGCTAAG 
C TGGAAAG AG 
CAATTGTCGC 
GAGCGTGACC 
GTCTTTTATC 
TTCCTTTTTG 
CCTTCTATAG 
TCTGGAATGG 
TTGATCAGCA 
TTAAGTTACC 
TACTGGATTG 
TCTTGGTTCT 
GCCTTCCTGT 
TGATATCAAC 
AC TTTTGTTA 
AGTGCTAGTT 
TTAGCTAATA 
TATGGTAGAC 
TAATAGGCTA 
TTCTCATGTT 
CAATTCTTTT 
AGGGTGATAA 
GATAAGCGCG 
TGGCTGGCCG 
AGACTCTGTA 
TCACTCTCAT 
GCTCTAGTCA 
TTTCACAGCC 
ACTATGACCC 
CCTTTCTTTG 
CTTGTACGTC 
CTTTGTCTTT 
GATTATCAGT 
ATGAATGCAA 
TTGTTTTGAC 
TTATATATAA 
GTTTTTAATC 
TTGCATATAT 
TCACATTATG 
TCTTTTTGAA 
GTTAAAGGTA 



AGTAACGGCT 
GOCGAACTGC 
CTATCGGGTA 
GTOGATCCTT 
TCGTATGGCT 
TTATTCTGTC 
AATGAAGCAT 
ACGCTTCTGA 
TGTTTQCCAT 
TAGTCTGCGC 
ATAGTAGTTA 
TGGTTGTTGC 
CTGTCAAQTG 
GACCATCCCG 
AGAATTGGAI 
TGAGTTTTC 
TTCATTAAT 1 
TGGCTTAAG 1 
TGTTCCATAC 
TTTACTTTCA 
TTCTCTTTTT 
C AGGTTG AC C 
TATTTGCCCA 
CTGAGGCTAG 
TATCCATGCT 
ATTTTATTTT 
AGC TCG AT AC 
GTTGAGGCAA 
TTCTGAGGTG 
CAGGTCTTAT 
ATTGAACTGC 
ATCAAAGAAC 
CAACCAAACC 
TTTAAGTTGA 
TGATTTGTTT 
TTTTGTTGGC 
GCGTCGTCAT 
CACATTCTGG 
TATACCACTG 
ACAAATATTA 
TATGCAACTG 
ATTTCGTTCC 
ACATGACCAG 
CGTGAACTAG 
AGATAGCAAG 
TGATAAGAAA 
AAATTGTTTT 
CTTACTATAA 
TCTCTCCTTA 
AAGGGGGTAT 
TAATAGCGTC 
CATGCTAACT 
CTCTGACTCC 
GGTGCTCATC 
CGTTGACTTT 
TTATGATTTA 
CTAAATGTAT 



TCCTTCACCG 
GGTGGAGGAT 
TAATCATTCC 
GACTGTCCAT 
AGACACCACT 
GGTCGAGTCA 
GTCTCGTCTC 
ATCTGAGTTT 
CGCTTTTGGT 
TTTTTATTCA 
GAAAATGTTA 
TT AC TGATCG 
CTGGTTTAAG 
AGTGTAAGTC 
£ C AGGTT AC A 
" 'CAATTTGAT 
fGGTCAATTT 
;GTGTGTTGA 
LATTGGAGAT 
CAAGCTCAAG 
ATGTAAGATA 
TTTGAAACTG 
TTCAATGGGT 
AAAT5TGC AC C 
TATTTCGCTG 
TTCTQTAATT 
CTGATTTGTT 
TC AAATC T AC 
ACCTCTOiACT 

aactcacVgg 
tgtacgcg&t 

ATATTGT 1 
ACATGTACA i 
GAAGAAACTT 
ATTGAAATCA 
AAGGCTTCAG 
TGTGGCTTAT 
ACGCATTTTT 
TGATGAAGAG 
TTAACATTGA 
T AAC ACT AAC 
TTTGATGTGT 
CATGGAATGT 
CAGATGGGAC 
AGGATGTTAC 
TATTGCTCAT 
GTTT AAATC T 
GAAACAAGTA 
TATTATGGAA 
TTTAGTTGAT 
AATTTTGTTT 
ATACTTTTCA 
TTGGGAGAGA 
TAAAGACAGA 
GTTATTATAT 
TCTTCTCCCT 
GAAGCTGTCT 



TCATCGCCGT 
GAGACCGAGT 
GGGATTTGCG 
AC ACTCCGTT 
AAGGTCCGTG" 
CTTGTTGATG 
TCTTATTGAT 
AGAGTCATAT 
TCGC TAAATG 
ACT AG ATC TG 
GGTCATTTTA 
ACGTGATGGA 
TGTATGGTGC 
ACGGCCTGAC 
TAACAGGTAG 
ATCATCTTGT 
TCAGGTCCTC 
GTTTGGTATA 
TGTCACCAAC 
TTAGTCCTTA 
AGCTAAGAGC 
CTTTAAAACT 
AATAATGTCT 
AAAACATTAT 
TTGGTACCGG 
GGGGGAGTTA 
GTTGATTTAG 
TCTCTCTGGT 
TCTCTTTAGT 
ATTTTCCTTT 
ATGGTATCTG 
TAGTATACCT 
TGATTTAGTT 
TGACTGAAAT 
^GTGACATAT 
LGAACTGCTC 
GiCCATTTTCA 
CTGGGGGTGC 
GAATATCAAT 

AAT?? CCTTCC 
lGTTTCA 

ATCCATCAGT 

GGCCTTCCCA 

TCTTTTCAAA 

accagotaaa 
cgatcat^cac 
ctatatcaat 
taatcagJ 

TGTCTTTltPC 
TGATTCTC^C 
TTCTAGC^ 
GGTTGTATC 
CCACCTATi 
GGTATGATG 
TCCCCATT1 
TGCATCTTAT 
GTCATAGGTT^ 
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200 
250 
300 
350 
400 
450 
500 
550 
•600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2050 
2100 
2150 
2200 
2250 
2300 
2350 
2400. 
2450 
2500 
2550 
2600 
2650 
2700 
2750 
2800 
2850 
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GGTTATTACT 
CACGGATATC 
CCGCAATGGC 
GTGGCATGTT 
AAGTACTTTT 
AAGTGGGAAG 
AGCAAAACAA 
GTGCTTTTAA 
GATTGTGCAA 
CCTATAACTG 
CTTCTTGCAA 
CTTGCTATGT 
TGATTATGAA 
GCAAGAATTG 
CTCTTTTTTA 
CTGGTTATGT 
TCTCTGTACT 
GAAAATAAAA 
TAAATGTTGA 
GCACCAAGGG 
GGGGAAGAGA 



TTGCCCCAAG 1 
ATTTATGAAA 
AGAAGTAAAA 
ATCTC AGTTG 
TTATC ATTCC ' 
AGGTGTTGCA 
AACTAACCCA 
AAAATTTGTT 
TATCTGCAGG 
GGGATGAGAC 
ACTACTGAAG 
TCTCTAGTAC 
ATTGATCTCT 
GCTCGGACCT 
GTTCCTCACC 
GTTGATTTAC 
CCTCAAGAAC 
CAACAGCCAG 
TCATGAGCAT 
TTAAGTACAT 
ACCGCAGTCT 



TGGCAAACCT 
L CTGAAGGTTC 
GGAAGGCA 
lTAAGCAAA 

tVttgagctt 

TG^AACATGA 
TTTCTGAATT 
TTAAGAAACC 
TCTGG*0?CTG 
GGTA^j&CTCA 
ACTAAQMAA 
ACTGCAA^AT 
TATAGGTA^C 
AAAGTTAAC 
TTATATAGAT 1 
CTCCAATTTG 
TTGTATTAAT 
AACACGATGG 
GGGTCAGACA 
AACCTTTTAT 
GGGAGCTTGA 



TATCCTGATA 
CCTCGTGTCA 
AAGTCTTCTG 
TTATTAAACA 
. AGTGGATGAT 
CACTTGTATC 
TCATATTATT 
G AAAAAC TAG 
TGGTTGATGG 
GAAGTTGGTT 
TACTTGCTTC 
TGACTCTCCG 
CTATCATTCA 
TAACAATGGC 
CAAACTTTAA 
TCTTTCTAA 
'AAACGAGA 
lGCGACGTA 
TCATAGCTAA 

gaagactctg 
taa^agtggg 



ATTGGATCAT 
AGGTAATTTT 
TATCAGTCTA 
ACTAAAATTT 
CAGTGGCTTA 
AAAGATAACT 
AGGAGTAGTC 
TTCATATCTT 
GAAC GCTGG A 
TTGAAATTAT 
TGGAACACTG 
CTACTTTTAT 
CTCTCTTGGT 
TCCCCAGGTA 
GTGTACTTTT 
AAATCATATA 
TTCTCATTGG 
CATGTGGAAC 
CATGACAAAA 
AGAGCATTCC 
TATTAA 



3300 



2900 
2950 
3000 
3050 
3100 
3150 
3200 
3250 

3350 
3400 
3450 
3500 
3550 
3600 
3650 
3700 
3750 
3800 
3850 



3896 
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<210> 12 
<211> 709 
<212> cDNA 
<213> Ly copers icon esculektum 



<400> 12 

CTGGGGCCAA 

TCAGATGTAC 

CATTCCCAAT 

AGGATTCTGA 

AAAGCAAATC 

GTGGCTTGAG 

TTACAAAAGG 

GTTTGTATTG 

TGAGGGATGA 

GCTGAACATT 

TATTCTCTTT 

GATATGTATT 

TCTACATCTC 

TTGGCGGAAC 

TTCAGAGAG 



AAGTG AAC AT 
AAGTGCATCT 
ATGACAAAGT 
AAGTTTTCCA 
ACAGGAACAT 
ATGTGGCATG 
TGGTGTCTGA 
ACATTGTAAG 
GGACTGCTAT 
GTGAATACAG 
TTGTGTATTT 
CGGGGATGTT 
ACATCCTGTC 
AACAAGTTTG 



aacAaggaca 
aaatatagag 
tacctXca&t 

GGGAC 
TGTCAG 
ATATTCAffcC 
TCCTCACT^T 
TATTGCAAC 
TGGGATTAC 
GTTAGAATAT 
AGGCCACCTT 
CACCTGGGAC 
ACACTATGTG 
CACAAACATT 



CCACAGTCAG 
CATC AAC ATG 
GAAGTACATA 
CAGCAGTTTG 
CCAGCTTTGA 
TGATAAAAAG 
TTTCTTCTAT 
AAAAGCAAAG 
GGAAAGCTCG 
TCAAATTATA 
^TCCCCGGTCA 
lGAGTTGCAG 
JATATTTAA 
T&AAGAAGAA 



AGCATGATGT 
GTGAAGATAT 
AC CTATT ATG 
GGAGCTTGAT 
TGCGGGAGCT 
TCCAAGTTTG 
AAATGTTTGA 
CGTGGGCCTC 
ATGTGC AT GG 
TTTTGCAAAA 
CAACGATGCA 
ATTGAAGAGT 
GAAACTTTGT 
AGCGAAATGA 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 



709 
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# 



<210> 13 

<211> 623 \ 

<212> PRT \ 

<213> Schizosaccharomycas pombe 

<400> 13 \ 

MASSKKSKTHKKKKETVKSPIDLPNSKKPT 7 0 

FFFAVGDDNAWDPATLDKFGNMLGSSDLg^ 140 
GYRSDHPVIMVPGVI S SGLESWSFNNCS I PYFRKRLWGSWSMLKAMF LDKQCWLEHLMLDKKTGLDPKGI 210 

KI,RAAQGFEAADFFITGYWIWSKVIE^ 280 

IVHKKKVVLISHSMGSQVTYYFFKW^^ 350 

TGIVITLNILEKFFSRSERAMMVR 420 

ddalqflknvtdddfkv^akn^^ 490 

ERGYYYTNNPEGQPVIDSSVNDGTKVENGIVMDDbDGTLPILALGLVCNK^ 5 60 

hepaafdlrggprsaehvdilghselneiilkvsAghgdsvpnryisdiqeiineinldkprn 623 



NAE 3377/99 PCT 



<210> 14 , 
<211> 432 
<212> PRT 

<213> Arabidopsis thaliana 
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^ISSHYSWIAILVVWMTSMCQAVGSN^LILVPGNGGNQLEVRLDREYKPSSVWCSSWLYPIHKK 7 0 
Sa^IU.WFDAAVLLSPFTRCFSDRMMLYYp^LDDYQNAPGVQTRVPHFGSTKSLLYLDPRLRDATSyM 140 
EHLVKALEKKCGYVITOQTILGAPYDFRYGLSA^SHPSRVASQFLQDLKQLVEKTSSENEGKPVILLSHSL 2 10 
GGLFVLHFLmTTPSVmRKYIKHFVALAAPWGG^SQMKTFASGNTLGVPLVNPLLVRRHQRTSESNQWL 2 80 
LPSTKWHDRTKPLVVTPQVlWTAYF^RFFADldPSQGVVPYKTRVLPLTEELMTPGVPVTCIYGRGVD 350 
TPEVLMYGKGGFDKQPEIKYGDGDGTVNIASLAAL^SLNTVEIDGVSHTSILKDEIALKEIMKQISII 420 

NYELANVNAVNE » 432 
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# 



<210> 15 . 

<211> 552 \ 

<212> PRT \ 

<213> Arabidopsis thaliahe 

<400> 15 \ 

mgansksvtasftviavfflicggrtaWetefhgdysklsgiiipgfastqlrawsildcpytpldfn 70 

PLDLVWLDTTKLLSAVNCWKCMVLDP^TDHPECKSRPDSGLSAITELDPGYITGPLSTVWF^KWC 140 
VEFGIEANAIVAVPYDWRLSPTKLEERfc^HKLKLTFETALKLRGGPSIVFAHSMGNNVFRYFLEWLRL 210 
EIAPKHYLKWLDQHIHAYFAVGAPLlX3gvKAIKSTLSGVTFGLPVSEGTARLLSNSFASSLWLMPFSKNC 280 
KGDNTFVraHFSGGAAKKDKRVYHCDEEEYQ9pSGWPTNIINIEIPSTSARELADGTLFKAIEDYDPDSK 350 
RMLHQLKKWPFFVIRNIAimSSLAGFLLYHtoPVFNPLTPWERPPIKNVFCIYGAHLKTEVGYYFAPSG 420 
KPYPDNWIITDIIYETEGSLVSRSGTVVDGNAiBPITGDETVPYHSLSWCKNWLGPKVNITMAPQILIGKI 490 
KQQPEHDGSDVHVELNVDHEHGSD 1 1 ANMTKAffilVKY I TF YED SE S I PGKRTAVWELDKSGY 5 52 
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<170> Patentln Ver.\2.0 

<210> 10c 
<211> 661 
<212> PRT 
<213> Saccharomyces ceAevisiae 



Met°Sli Thr Leu Phe Arg IfS Asn Val Gin Asn Gin Lys Ser Asp Ser 



10 



Asp Glu Asn Asn Lys Gly Gl\- Ser Val His Asn Lys Arg Glu Ser Arg 



20 



25 



Asn His lie His His Gin GlnWy Leu Gly His Lys Arg Arg Arg Gly 
35 \JQ 45 

He Ser Gly Ser Ala Lys Arg A^) Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 >T 60 

Lys Arg Asp Gly Asn Gly Arg Lys\Arg Trp Arg Asp Ser Arg Arg Leu 



65 



70 



He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 - \ 90 9 5 

Glv Ala Tyr His Val His Asn Ser As\ Ser Asp Leu Phe Asp Asn Phe 
100 105\ 11° 

Val Asn Phe Asp Ser Leu Lys Val Tyr \eu Asp Asp Trp Lys Asp Val 
H5 120 \ I 25 

Leu Pro Gin Gly He Ser Ser Phe He Asp Asp lie Gin Ala Gly Asn 
130 135 \ 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu SeAciu Asn Phe Ala Val Gly 
145 150 V55 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala. Lys His Pro Val Val 

165 I 70 X 175 

Met Val Pro Gly Val He Ser Thr Gly He GlVi Ser Trp Gly Val He 
180 185 \ I 90 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe\ Arg Lys Arg Leu Trp 
195 200 \ 205 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



. „. M-h Ai Ara Thr Met Val Met Asp Lys Val Cys Trp 
Gly Ser Phe Tyr Met Leu Arg ini 22Q 

210 V 21t> 

Leu Lys His V,l «e, LeuUp Pro G1U Thr Oly Leu ASP Pro «o -J 

HI Thr Leu Arg Ala HI ih Oly Phe Olu ser Thr Asp Tyr Phe IX. 
Ma sly Tvr Trp Z Trp Asi Lys -1 Phe Oln Ash « « l 
G l y Tyr Olu To As, Lys M«\ Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
Ma Tyr » Leu Olu X ASP ^ ™ ^ " U ^ 

Glu HI He Olu Leu Phe His L Leu Ser oly olu Lys Val cys Leu 

T. Oly His ser Me, X Ser § He XI. *» **r Phe Me, Lys Trp 

325 JJU 
,1 Olu Al. Olu Oly Pro Leu t! Oly Ash Oly oly Arg ? ly Trp Val 

«, Olu His 111 Asp ser Phe 111 — Ala «• Thr Leu Leu oly 

355 360 \ 
A la pro Lys Me val Pro »U Leu\lle Ser Oly Olu Me, Lys 

U e Oil Leu Asn Thr Leu Ala Met L Oly Leu Olu Lys Phe Phe Ser 

3 85 3 90 \ 

Axg lie Olu Arg val Lys Me, Leu o\n Thr Trp Oly Oly He Jr. Ser 

Me, Leu Pro Lys Tiy Olu Olu Val I A Trp Oly Asp Me, Ser Ser 

42 0 

S er Olu Asp Ala Leu Asn Asn » ThrW Thr Tyr Oly Ash Phe He 
« Phe Til at, Asn Thr ser asp Ala L Asn Lys Ash Leu Thr Me, 
Lys Z Ala He Ash Me, Thr Leu Ser Ae J« Pro Olu Trp Leu Oln 
111 Ar g val His Olu oln Tyr Ser Phe «W «« «" "» »S ^ 
slu Leu Arg. Lys « olu Leu His .His Lylis Trp Ser Ash Pro Me, 

Olu val Pro HI Pro Olu Ala Pro His Me, U He Tyr cys He Tyr 

515 5 ^ u \ 



60 
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565 \ 570 

10 Met Cys His Lys Trp Ala Gin ^xy — « r 59Q 

580 \ 585 

lie Val\Glu 
595 \ 600 



Gly Val Asn AsnUro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 \ 535 

Asp Ser Ser Ala Lu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
5 545 \ 550 

Val Phe Leu Thr Gl\oiy Asp Gly Thr Val Pro Leu Val Ala Hi- Ser 

Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
585 590 

lie Asn Val Thr He Val\ G lu Met Lys His Gin Pro Asp Arg Phe Asp 

15 He Arg Gly Gly Ala Lys s\ Ala Glu His Val Asp He Leu Gly Ser 
610 61 V 
. Ala Glu Leu Asn Asp Tyr Ile\ Leu Lys He Ala Ser Gly Asn Gly Asp 
20 625 "0 \ "5 

Leu Val Glu Pro Arg Gin Leu Lr Asn Leu Ser Gin Trp Val Ser Gin 
645 x 6bU 

25 Met Pro Phe Pro Met 
660 

<210> 2(X* 
30 <211> 387 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 2 r -n^Von val Pro Glv ' Asn Gly Gly 

35 Val Gly Ser Asn Val Tyr Pro Leu He W Val Pro ^xy ^ 

1 5 V" 0 

Asn Gin Leu Glu Val Arg Leu Asp Arg g\u Tyr Lys Pro Ser Ser Val 

20 2b \ 

Trp Cys Ser Ser Trp Leu Tyr Pro He HisUys Lys Ser Gly Gly Trp 



40 



35 40 



Phe Arg Leu Trp Phe Asp Ala Ala Val Leu Aeu Ser Pro Phe Thr Arg 
45 50 55 \ 

Cys Phe Ser Asp Arg Met Met Leu Tyr Tyr As\ Pro Asp Leu Asp Asp 
65 70 75 \ 

50 Tyr Gin Asn Ala Pro Gly Val Gin. Thr Arg Val Wo His Phe Gly Ser 

85 90 \ 

Thr Lys Ser Leu Leu Tyr Leu Asp Pro Arg Leu ArV Asp Ala Thr Ser 
100 10b \ 

Tyr Met Glu His Leu Val Lys Ala Leu Glu Lys Lys^ Gly Tyr Val 

12 0 



55 



115 



Asn Asp Gin Thr He Leu Gly Ala Pro Tyr Asp Phe ^g Tyr Gly Leu 
60 130 135 



31/59 



10 



15 



20 



25 



30 



35 



40 



45 



Ala Ala Ser Gly His Pri Ser Arg Val Ala Ser Gin Phe Leu Gin Asp 

145 15ft 155 

Leu Lys Gin Leu Val GluVys Thr Ser Ser Glu Asn Glu Gly Lys Pro 
165 \ 170 1 

Val lie Leu Leu Ser His sW Leu Gly Gly Leu Phe Val Leu His Phe 
180 \ 185 I 90 

Leu Asn Arg Thr Thr Pro sir Trp Arg Arg Lys Tyr lie Lys His Phe 
195 \ 200 205 

Val Ala Leu Ala Ala Pro TrA Gly Gly Thr He Se? Gin Met Lys Thr 
210 23 "A 220 

Phe Ala Ser Gly Asn Thr LeuWy Val Pro Leu Val Asn Pro Leu Leu 
225 230 \ 235 

Val Arg Arg His Gin Arg Thr Lf Glu Ser Asn Gin Trp Leu Leu Pro 
245 V 250 

Ser Thr Lys Val Phe His Asp aX Thr Lys Pro Leu Val Val Thr Pro 
260 \ 265 

Gin Val Asn Tyr Thr Ala Tyr Glu\ Met Asp Arg Phe Phe Ala Asp He 
275 2 80\ 285 

Gly Phe Ser Gin Gly Val Val Pro Wr Lys Thr Arg Val Leu Pro Leu 
290 29S \ 300 

Thr Glu Glu Leu Met Thr Pro Gly V,a 1 Pro Val Thr Cys He Tyr Gly 
305 310 \ 315 320 

Arg Gly Val Asp Thr Pro Glu Val Lau Met Tyr Gly Lys Gly Gly Phe 
325 » 330 

Asp Lys Gin Pro Glu He Lys Tyr GlA Asp Gly Asp Gly Thr Val Asn 
340 345\ 350 

Leu Ala Ser Leu Ala Ala Leu Lys Val W Ser Leu Asn Thr Val Glu 
355 360 \ 365 

He Asp Gly Val Ser His Thr Ser He Aeu Lys Asp Glu He Ala Leu 
370 375 \ 380 

Lys Glu He 
385 



50 



55 



60 



<210> 3<^ 
<211> 389 
<212> PRT 

<213> Arabidopsis thaliana 

Llu°Lys Lys Glu Gly Leu Lys Ala Lys His pAo Val Val Phe He Pro 
1 5 10 1 

Gly He Val Thr Gly Gly Leu Glu Leu Trp Gl\ Gly Lys Gin Cys Ala 
20 25 » 30 
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Asp Gly Leu Phe Arg Lys\Arg Leu Trp Gly Gly Thr Phe Leu Cys Trp 
35 \ 40 45 

Val Glu His Met Ser Leu \sp Asn Glu Thr Gly Leu Asp Pro Ala Gly 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



50 



60 



lie Arg Val Arg Ala Val SeV Gly Leu Val Ala Ala Asp Tyr Phe Ala 
65 70 \ 75 80 



Pro Gly Tyr Phe Val Trp AlaXVal Leu 

85 \ 90 



lie Ala Asn Leu Ala His lie 

95 



Gly Tyr Glu Glu Lys Asn Met Tyr Met Ala Ala Tyr Asp Trp Arg Leu 
100 \ 105 110 

Ser Phe Gin Asn Thr Glu Arg As\ Gin- Thr Leu Ser Arg Met Lys Ser 
.115 12 0\ 125 

Asn He Glu Leu Met Val Ser Thr \sn Gly Gly Lys Lys Ala Val He 

135 \ 140 



130 

Val Pro His Ser Met Gly Val Leu 
145 150 



=he Leu His Phe Met Lys Trp 
155 160 



Val Glu Ala Pro Ala Pro Leu Gly Gly\Gly Gly Gly Pro Asp Trp Cys 

165 y™ 175 

Ala Lys Tyr He Lys Ala Val Met Asn ke Gly Gly Pro Phe Leu Gly 
180 185 \ 19° 

Val Pro Lys Ala Val Ala Gly Leu Phe Se\ Ala Glu Ala Lys Asp Met 



195 



200 



Arg Met Thr Arg Thr Trp Asp Ser Thr Met\Ser Met Leu Pro Lys Gly 
210 215 \ 220 

Gly Asp Thr He Trp Gly Gly Leu Asp Trp S^r Pro Glu Leu Pro Asn 



225 



230 



Ala Pro Glu Met Glu He Tyr Ser Leu Tyr GlV Val Gly He Pro Thr 
245 250 \ 255 

Glu Arg Ala Tyr Val Tyr Lys Leu Asn Gin SerWo Asp Ser Cys He 
260 265 \ 270 

Pro Phe Gin He Phe Thr Ser Ala His Glu Glu aW Glu Asp Ser Cys 
275 280 \ 285 

Leu Lys Ala Gly Val Tyr Asn Val Asp Gly Asp Gl\ Thr Val Pro Val 
290 295 3O0\ 

Leu Ser Ala Gly Tyr Met -Cys Ala Lys Ala Trp Arg W Lys Thr Arg 
305 310 315 \ ^ u 

Phe Asn Pro Ser Gly He Lys Thr Tyr He Arg Glu T^r Asn His Ser 
325 330 \ 335 

Pro Pro Ala Asn Leu Leu Glu Gly Arg Gly 'Thr Gin Se^ Gly Ala His 
340 345 \ 350 

Val Asp He Met Gly Asn Phe Ala Leu He Glu Asp He Wet Arg Val 



33/59 



355 



360 



365 



Ala Ala Gly Oly Asn Gly kr Asp He Gly His Asp Gin Val His S.er 
370 3^ 5 380 

Gly He Phe Glu Trp 
3 85 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<210> 4<X 
<211> 1986 
<212> DNA 

<213> Saccharomyces cerevis 

<220> 

<221> CDS 

<222> (1) . • (1983) 



xa« 



ItTJc aca «w ttt cja aja «t W « « ~ "J £ Kp IS 

Met Gly Thr Leu Phe Arg- Arg Asn K/al Gin ash ltj-u * ig 
1 5 \ 10 



gat gaa aac aat aaa ggg ggt tct 
Asp Glu Asn Asn Lys Gly Gly Ser 
20 



cat aac aag cga gag age aga 
His Asn Lys Arg Glu Ser Arg 
30 



ss s) ss ss st; ss ss 2$ ss ss s s s ss 

Si IS IS is s si k SS £ E £2 Sip K 

50 55 

_ aaa cat tVa aga gat tec aga aga ctg 

aaa aga gac ggg aac ggt aga aaa cgc *y y 

Lys Arg Asp Gly Asn Gly Arg Lys Arg T:k> Arg Asp Ser Arg Arg ^ 
65 70 \ 75 

Si lit & ttt S£ K E SS ?Sj\SS SS S S£ S|| ss 

85 90 

ggc gee tat cat gtt cat aat age gat age &c ttg ttt gac aac ttt 
Sly Ala Tyr His Val His Asn Ser. Asp Ser jjm Leu Phe Asp Asn Phe 
100 105 

Si L*S s up is as £ ?2 ^ S! Sp\S SS *" ss ss 

as ir* s! ss ;s ts ttt ;s us s |s as s ss ss 

130 "5 
tac tec aea tct tct tta gat gat etc agt gaaaaittt gee gtt ggt 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn^Pne A±a va y 
145 150 I 55 

aaa caa etc tta egt gat tat aat ate gag gee aaa W cct gtt gta 
Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys ^ s Pro Val Va± 



48 



96 



144 



192 



240 



288 



336 



384 



432' 



480 



528 
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165 



170 



175 



S? SS lit III SS !S SS S 1| ?S SK SE. 8S f S ?S SS 

180 \ 185 

gga gac gat gag tgc gat W tct gcg cat ttt cgt aaa egg ctg tgg 

Gly Asp Asp Glu Cys Asp Her Ser Ala His *ne ^9 
195 \ 200 

1?; SS £ X? s 4 S SS 5S £ || ss 3S s 

210 21 - 
ttg aaa cat gta atg tta gat\cct gaa aca ggt ctg gac cca ccg aac 
Leu Lys His Val Met Leu Asp Vro Glu Thr Gly Leu Asp rro 
225 230 \ 235 



) 



0 



5 



50 



55 



60 



ttt acg eta cgt gca gca cag gi 
Phe Thr Leu Arg Ala Ala Gin 
245 



ttc gaa tea act gat tat ttc ate 
Phe Glu Ser Thr Asp Tyr Phe lie 
250 255 



576 



624 



672 



720 



768 



lit IS £ S & SSSS 5*2 SS §£ IS S SS Si S5 

260 <~£65 

ggc tat gaa ecc aat aaa atg acg a\t get gcg tat gat tgg agg ctt 

Gly Tyr Glu Pro Asn Lys Met Thr S^r Ala Ala lyr 
275 280 

gea tat tta gat eta gaa aga egc gat\agg tae ttt acg aag eta aag 
La Tyr Leu Asp Leu Glu Arg Arg AspWg Tyr Phe Thr Lys Leu Lys 
290 '295 \ 3°° 

gaa caa ate gaa ctg ttt eat caa ttg aVt ggt gaa aaa gtt tgt tta 
Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val cys ^e 
305 310 X 315 

att gga eat tct atg ggt tet cag att atc\ ttt tac ttt atg aaa tgg 
lie Gly His Ser Met Gly Ser Gin He Ile\phe Tyr Phe Met Lys Trp 
325 330 \ JJ3 

gtc gag get gaa ggc cet ctt tae ggt aat gW ggt cgt ggc tgg gtt 
lit 111 La Glu Gly Pro Leu Tyr Gly Asn G&r Gly Arg Gly Trp Val 
340 345 \ 

aac gaa cac ata gat tea ttc att aat gca gea\ggg acg ctt ctg gge 
Asn Glu His He Asp Ser Phe He Asn Ala Ala >Gly Thr Leu i,eu 
355 360 \ 365 

get cca aag gea gtt cea get eta att agt ggt gk atg aaa gat acc 
Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Giy wee by 
370 375 38! 

att caa tta aat acg. tta gee atg tat ggt ttg gaa\aag ttc ttc tea 
He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu )Lys Phe Phe ^ 
385 390 395 

aga att gag aga' gta aaa atg tta caa acg tgg ggt ggt ata cca tea 
Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He ero s 
405 • 410 1 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 
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«B eta cca J.s gA ga g. £= att tg a. gt j*. -J tea tct 

Met Leu Pro Lys GlyXGlu Glu vax ue J-J-f * 
420 \ 425 

£ SS IS K S X - SS £ £ 5S S? SS - - Si 

435 \ 440 

k sss ss s ss A s? - s - i s ° ~ ?ss ~ 

450 \ 455 
aaa gac gcc att aac atg ^a tta teg ata tea cct gaa tgg etc caa 
Lys Asp Ala He Asn Met Tfcr Leu Ser He Ser Pro faiu irp ^ 
465 * 70 X 4 S 

s s ss ss ss ss ds? ss: is ss ss 5S - i - 

485 \ 490 

gag tta aga aaa aat gag eta dae cac aag cac tgg teg aat cca atg 
Glu Leu Arg Lys Asn Glu Leu &Ls His Lys his iiy » . 
500 505 

gaa gta cca ctt cca gaa get cc\ cac atg aaa ate tat tgt ata tac 
Glu Val Pro Leu Pro Glu Ala PrA His Met Lys He Tyr Cys He Tyr 
515 520^ 525 

ggg gtg aac aae eea act gaa agg |ca tat gta tat aag gaa gag gat 
Gly Val Asn Asn Pro Thr Glu Arg AUa Tyr vax ryx j-ot 
530 535 \ 540 

SS SS S2 SS 2! £S £SS - Sj S" SS 5! SE SS 

545 550 \ 555 



gta ttc etc acc gag ggg gac gga acc 
Val Phe Leu Thr Glu Gly Asp Gly Thr 

565 • 5 ' 



ccg etc gtg gcg cat tea 
Pro Leu Val Ala His Ser 
575 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



.„ Wt cac aaa t ag gc cag |? t «=t t=a\ cc* tac aac cct jee g. 

Met Cys His Lys Trp Ala Gin Gly Ala ber\tru 
580 585 

ss ss ss ss in 52 sr. ss? s: ss s: K s ss ss- 

595 600 x bU!3 

ata cgt ggt gga gea aaa age gee gaa eae gtaW ate etc ggc age 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val \Asp He Leu uxy :> 

610 * 20 
gcg gag ttg aac gat tac ate ttg aaa att gca a|e ggt aat ggc gat 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Aia sex * y 
625 630 6 3 5 

etc gtc gag cea cgc caa ttg tct aat ttg age cag\tgg gtt tct cag 
leu val GlS Pro Arg Gin Leu Ser Asn Leu Ser Gln^Trp Val Ser Gin 
645 650 



1776 



1824 



1872 



1920 



1968 
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atg ccc ttc cca atg taa 
Met Pro Ph.e Pro Mat 
660 



1986 



<210> 5 \ 
<211> 661 \ 
<212> PRT \ 
<213> Saccharomyces cferevisiae 

Me^Gly Thr Leu Phe Ar\ Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

1 5 \ 10 

Asp Glu Asn Asn Lys GlyVy Ser Val His Asn Lys Arg Glu Ser Arg 

20 \ 25 

Asn His lie His His Gin G \ Gly Leu Gly His Lys Arg Arg Arg Gly 

35 \ 40 

lie Ser Gly Ser Ala Lys ArgUsn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 W 60 

Lys Arg Asp Gly Asn Gly Arg L^D Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 \ 75 

lie Phe lie Leu Gly Ala Phe Le\ Gly Val Leu Leu Pro Phe Ser Phe 

85 \ 90 

Gly Ala Tyr His Val His Asn SerVsp Ser Asp Leu Phe Asp Asn Phe 

100 W s 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 

115 120 \ 

Leu Pro Gin Gly He Ser Ser Phe IleVsp Asp lie Gin Ala Gly Asn 
130 135 \ 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu s\ Glu Asn Phe Ala Val Gly 
145 150 \155 

Lys Gin Leu Leu Arg Asp Tyr Asn He GluW Lys His Pro Val Val 
165 I 70 \ 

Met Val Pro Gly Val He Ser Thr Gly He g\u Ser Trp Gly Val He 
180 185 \ 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His PheWg Lys Arg Leu Trp 
195 200 \ 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met a\p Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly LeuV Asp Pro Pro Asn 
55 225 230 235 \ 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 \ 

60 Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn LeV Gly Val lie 
260 265 \ 



10 



15 



20 



25 



30 



35 



40 



45 



50 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Glv Tyr Glu Pro Asn Lys\ Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 \ 280 285 

Ala Tyr Leu Asp Leu Glu \rg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 2195 300 

Glu Gin lie Glu Leu Phe Hi\s Gin Leu Ser Gly Glu Lys Val Cys Leu 

315 320 



305 



310 



lie Gly His Ser Met Gly Ser\Gln He He Phe Tyr Phe Met Lys Trp 
325 \ 330 335 

Val Glu Ala Glu Gly Pro Leu Wr Gly Asn Gly Gly Arg Gly Trp Val 
340 \ 345 350 

Asn Glu His He Asp Ser Phe l\e Asn Ala Ala Gly Thr Leu Leu Gly 
^-3 55 36b 365 

Ala Pro Lys Ala Val Pro Ala Leu\ He Ser Gly Glu Met Lys Asp Thr 
370 375 \ v. 380 

He Gin Leu Asn Thr Leu Ala Met Gly Leu Glu Lys Phe Phe Ser 

3 95 4uU 



385 



390 



Arg He Glu Arg Val Lys Met Leu c\n Thr Trp Gly Gly He Pro Ser 
405 \ 410 415 

Met Leu Pro Lys Gly Glu Glu Val Ile\ Trp Gly Asp Met Lys Ser Ser 
420 425\ 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr ksp Thr Tyr Gly Asn Phe He 
435 440 \ 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Plk Asn Lys Asn Leu Thr Met 
450 455 \ 460 

Lys Asp Ala He Asn Met Thr Leu Ser Ile\ser Pro Glu Trp Leu Gin 
465 470 V?5 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly fyr Ser Lys Asn Glu Glu 



485 



490 



Glu Leu Arg Lys Asn Glu Leu His His Lys Hi\ Trp Ser Asn Pro Met 
500 505 \ 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys \ls Tyr Cys He Tyr 
515 520 \ 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val T^r Lys Glu Glu Asp 
530 535 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu\Ser Lys Gin Pro 

555 \ = 6U 



545 



550 



Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 \ 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asji Pro Ala Gly 
580 585 \ 590 
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He Asn Val Thr He Val Glu Med Lys His Gin Pro Asp Arg Phe Asp 
595 600\ 605 

He Arg Gly Gly Ala Lys Ser AlaAdfru His Val Asp He Leu Gly Ser 
610 615 A) 620 

Ala Glu Leu Asn Asp Tyr He Leu lVs He Ala Ser Gly Asn Gly Asp 
625 630 A 635 640 

Leu Val Glu Pro Arg Gin Leu Ser AsA Leu Ser Gin Trp Val Ser Gin 
645 \ 650 655 



Met Pro Phe Pro Met 
660 



SEQUENCE LISTING 
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<110> Scymne Dr. , VSten 
<120> 



<130> 
<i;u> 

<141> 
<1S0> 



<2io> x jtr 

<211> 19S6 
<212 > genomic DN7\> 
<213> Saccharomyces cerevis^ae 
<220> 
<221> CDS 
<222> (1) . * (1963) 

< ''°° > 1 .J^crtc cag aac caa aag agt gat tec 

atS 99C a« ecg ttt cga aga *KVj^ • £er Asp Scr 

Met Gly Thr Leu Phe Arg Arg AsnWl Gin Asn Gin .y ^ 

1 5 \ 10 

««t- ret adt cat aac aag cga gag age aga 
gftC gaa aac aat m «9 |£ c ^ Lya ^ Glu ser Arg 

Asn Glu Ann Asn Ly= Gly Gly -er v * 
20 



48 



96 



?go cat aag aga aga agg ggt 
jp.y His Lys Axg Arg Arg ,Gly 
AS 



aac cac att cat cat caa cag gga tta 
As-. Kis He Kis His Gin Gin Gly Leu 
35 40 

att age ggc agt gca aaa aga aat gag cg.ggc aaa jac «c £ agg 
lie Ser Gly Ser Ala Lys Arg Asn Glu Arg\,Gly Lys A-F ^ - 



gat tec aga aga ctg 
Asp Ser Arg Arg Leu 

80 



aaa aga gac ggg aac ggt aga aaa cgt egg a* 
Lya Ara Asn Gly Asn Gly Arg Lys Ar 5 Trp 

att tt= att =tt ggt gca ttc tta ggt gta ctAtg ccg ttt age ctt 
Tie Phe He Leu Gly A2.« Phe Leu Gly v«l Leu i-« 

as 20 x 

ggc get tat cat gtt cat aat age gat age gac tt* ttt gac aac ttt 
Gly Ala Tyr Kis Val Kis Asn Ser Asp Ser Asp ^ 
10 0 105 * 



14.4 



192 



240 



288 



33S 
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gta aac ttt gat tcfe etc aaa gtg tat ttg gac gat tgg aaa gat gtt 
Val Asa Phe Asp Sol Leu Lys Val Tyr Leu A=p Asp Trp Lye -Asp Val 
115 \ 120 125 



3S4 



cue cca caa ggt aca 
Leu. Pro Gin Gly lie 
130 



It teg ttt act gat gat- act cag get ggt aac 
>r Scr Phe lie Asp Asp He Gin Ala Gly .Asn 
135 



43 2 



cac tec aca tet ccc tta\gat gat ccc agt gaa aat ttt gec gtt ggt 

Tyr Scr Thr Ser Ser Leu W Leu Ser Glu Asn Phe Ala Val Gly 

150 \ 155 ISO 



14 



4S0 



aaa caa etc tta cgt gat ca\t aat acc gag gee aaa cac ccl yui yua 
Lys Gin Leu Leu Arg Asp Ty\ Asa He Glu Ala Lys His Pro Val Val 

170 17S 



165 



32 0 



atg get cct ggt gtc att tet 
Mec Val Pro Gly Val lie Ser 

ISO 



gga att gaa age tgg gga gtt att 
Gly He Glu Ser Trp Gly Val He 
nas 190 



57< 



ggcv gcic gat SJ^-S cgc 
Gly Asp Asp GlU Cys 
1£>5 



oat agi- t-r-.H geg cat ttt cgt aaa cqq ctg tgg 
Asp Ser Ser\Ala His Ph& Arg Lys Arg Leu Trp 
200 \ 205 



aga agt ttt tac atg etg aga aca atg gtt. atg sat aaa gtt tgt tgg 
Gly Ser Phe Tyr Met Leu Arg Thr MeV: Val Met Asp Lys Val Cys Trp 
210 215 \ 220 



624 



€72 



ctg aaa cat gta atg tta gac cct gaa \aca ggt etg gac cca ccg aac 
Leu Lys Kis Val Met Leu Asp Pro Glu Vhr Gly Leu Asp Pro Pro Asn 

235 240 



225 



230 



720 



ttt acg eta cgt gca gca cag gge ttc gaW tea act gat tat ttc acc 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Gl^ Ser Thr Asp Tyr Phe He 
245 



250\ 



255 



7SQ 



aca egg tat tgg att tgg aac aaa gtt ttc W aat ctg gga gta att 
Ala Gly Tyr Trp He Trp Asa Lys Val Phe <*ln Asn Leu Gly Val He 
260 265 



270 



cgc tat gaa ccc aat aaa atg acg .agt get gete tat gat tgg agg ctt 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala AlA Tyr Asp Trp Arg Leu 
275 280, \ 2BS 



BIS 



864 
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3 „ «. == * ... r - - »: - ;r u £ 

Ma Tyr Leu Asp Leu\clu Arg Arg A^p Arg iy 
290 \ Z9S 

Ma ate aaa ctg t\t cat caa ttg agt ggt gaa get tgt tta 

;r. si; s a «- - g - ^ vai cya r 0 

305 ^ 

A rt caa att ate ttt tae ttt acg aaa " tgg 

act ggn cat tec atg ggt Vet cag att a t 

He Gly His Ser Met Gly sW Gin lie lie Pne Ty 

3 2.S N 33 

.- -t-A t-.e oqt aat ggt ggt cgt ggc egg gtt 

i" s z z: iz B « y - - - **• ^ 

340 \ 345 



912 



9S0 



aat gca gca ggg acg ctt ctg ggc 
, sn Ala Ala Gly Thr Lou Lea Gly 
3 SS 

:t agt ggt gaa atg aaa. gat acc 
Le Ser Gly Glu Met Lys Asp Thr 
3 SO 



aac gaa^ cac ata gae tea ttc at 
A=n Glu Kis He Asp S&r Phe Il« 
355 3S< 

get cm aag gca get cca get eta 
Ala Pro Ly= Ala Val Pro Ala Leu 
370 375 

.A ___ tt g gaa aag ttc ttc cca 
att caa tta aat acg tea jee atg 4 ^ J ^ phe Scjr 

lie Gin Leu Asn Thr Leu Ala Met. Tyr\^J.y ^ 

385 390 

™ aaa atg tta caa aAg tgg ggt ggt ata cca tea 

aga att gag aga gca aaa atg t -J Jle pro Scr 

Arg lie Glu Arg Val Lys Met: Leu Gin Th* Trp 415 
405 ' - 1 N 



1008 



lOStS 



1104 



1152 



1200 



1248 



}99 



gat atg aag tea tct 
,y Asp Met Lys Ser Ser 
430 



atg eta cca aag gga gaa gag gtc att tgg 

Pet Leu Pro Ly S Gly' Glu Glu Val He Trp 

-120 425 
t „ gag gat gca ttg aat aac aac act gae aca\tac ggc aat ttc att 
Sar Glu Asp Ala Leu Asn A«i A.n Thr Asp ThrXTyr Y 

i. „<-t ttc aac aVa- aat fct9 aca aC9 
=ga ttt gaa agg aat acg age gat get ^ ^ ^ ^ 

A rg Phe Glu Arg Asn Thr Ser Asp Ala Ph. Asn * 

450 455 

v *=a tta teg ata cca cc\ gaa egg etc caa 
aaa gae gee att aac atg aca tea 3 p r o\.alU Trp Leu Gin 

Ly. Aso Ala He Asn Met Thr Leu ser He Ser ^ 

465 470 



1236 



13 44 



1392 



1440 
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\ - ttc qgc cac ccc aag aac gaa gaa 1483 

aga aga gta cac gag cagUac teg as Glu Glu 

* rg Arg Val His Glu oinWr ser Ph. Gly Tyr 
4SS 1 



At a cac cac aag cac tgg teg aat cca a. eg 

gag cca aga aaa aac gag ctea ^ A=n pro M£t 

Glu Leu Arg Lys Asn Glu LW KiS Ki- / ^ 

s o o 

\ ,- ee cac atg aaa acc tat tgt aCa cac 

ga a gta cca ctt cca gaa 9=V« Ile Tyr C ys He Tyr 

Glu val Pro Ltu Pro Glu AlaWo Hi- y g2S 
515 V° 

oca t« Sta tat aag gaa gag gat 

ggg gcg aac aac.cca act gaa. afeg g ^ ^ ^ q1u Rsp 
Gly val A=r. Asn Pro Thr Glu Aig Ala y 
530 335 



1536 



15 



1S3 2 



jce gac tac gaa age aag eati. ccc 
£l. A SP Tyr Glu ser Lys Gin Pro 

sss sso 



isao 



gac tec cct gee ctg aat ctg aej 
A-p ser Ser Ala Leu Asn Leu ~ u " 
S45 550 

r- D =a aVc gtt ccg etc gtg geg cat tea 
9" ttc etc ace gag ggg 9- gga * 9 ^ ^ Al& His Ser 

val Phe Leu Thr Glu Gly Asp Gly Tftr ^ 

sss x 

i- DCt \cca ccg tac aac cct gec gga 
atg tgc cac aaa egg S=c cag ggt g \ am '.Pro Ala Gly 

cys His Lys Trp Ala Gin Gly Ala \ser Ty 
580 385 \ 

i. _ aaa c3 ,c cao cca gat ega ctt gat 

att aae gtt act att gtg gaa atg aaa a ^ Asp 

lie Xcn val Thr Ila Val Glu Mat Lys Ki^ Ota 
5 5 5 600 

c-e\gta gac ate etc ggc age 

«. ««= 93 = 9.' HI h!s V.l M, II. «- «y «« 

lie Arg Gly Gly Ala X-y- ser ^ \ 

■ att ac>a age ggt aat ggc 9 at 
=eg gag ctg aac gat tac ate ttg * - ? A=n Qly Asp 

La Glu Leu Asn Asp Tyr He Leu Lys Xle Al^ 

w ^ *at ctg agc\cag tgg gtt tct cag 

etc gt= gag eca cgc c« ttg tct aat ctg g - ^ ^ ^ 

Le u Val Glu Pro Arg Ola Leu Ser Asn L-U ^ ^ 

645 



1728 



1824 



1S72 



1920 



196S 



43/59 



198$ 



atg ccc ttc cca at< 
Mac Pro Phe Pro Met 
650 



itaa 
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<210> 2 1? 
<211> 6S1 
<212> PRT 

<2 13 > Saccharomyce 

Met Gly Thr Leu Phe 

Asp Glu Acn Asn Lys 
20 

Asn. His He Kis His 

35 ' 

He Ser Gly S<*r Ala 
SO 

Lya Arg Asp Gly Asn 
GS 

Xle Phe He Leu Gly 

85 

Gly Ala Tyr His Val 
100 

Val Asn Phc Asp Ser 
115 ' 

Leu Pro Gin Gly Xle 
130 

Tyr Ser Thr Ser Ser 
145 

Lys Gin Leu Leu Arg 

Met Val Pro Gly Val 
180 

Gly Asp Asp Glu cys 
195 

Gly Ser Phe Tyr Met 
210 

Leu Lys His Val Met 
225 

Phe Thr Leu Arg Ala 
245 

Ala Gly Tyr Trp lie 

2(30 

Gly Tyr Glu Pro Asn 
275 

Ala Tyr Leu Asp Leu 
290 

.Glu Gin Xle Glu Leu 
305 




Trp Asn Lys Val 
255 

Lys Mec Thr Ser 
280 

Glu' Arg Arg Asp 
235 

Phe His Gin Leu 
310 



Asp Pro Pro Asn 
240 

Asp Tyr Phe lie 
25S 

Leu Gly Val lie 
270 

Asp Trp Arg Leu 
2S5 

Thr Lys Leu Lys 

Lys Val Cys Leu 
320 
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,i. o,y His ... K.« L = « «n »• X.l. «- ^ «- 

„x ox. M . <"» A ° ly Z V " 

M » His u: w -y= «• »' - £ - Giy 

\ 360 

355 \i Tie ser Gly Glu Met Lys Asp Tar 

Ala Pro Ly= Ala Val Pro a\U Leu He Ser y ^ 

370 ! A M » t Tvr Gly Leu Glu Lys Phe Phe Ser 

He Gin Leu Asn Thr Leu Ala\Met Tyr Gly ^ 

z: ix. al u ^ v S1 z: kA» «» •» ii= n: •« 

_ ^ .„ a" -X- «V; °* Aap ' HCC «« ■« 

420 /CL **r, Thr Tvr Gly Asn Ile 

■ ser Glu A 3P Ala. Leu Asn Asa Asn (fe Asp Thr Tyr ^y 

435 V° \* Phe Asn Lys Asn Leu Thr Ket 

Xrg Phc Glu Arg Asn Thr Ser Asp A^a Phe As Y^ 

450 455 B Y ti- ser Pro Glu Trp Leu. Gin 

L,ys Asp Ala Ile Asn Met Thr Leu 5«V H« Ser 

46S 1X10 e onAslv Tyr Ser Lys Asn Glu Glu 

Arg Arg Val Kis Glu Gin Tyr Ser Phe >Gly Tyr ^ 

485 T «*«, vl- As His Trp ser Asn Pro Met 

Glu L=u Arg Lys A=n Glu Leu His Kx- L*- * q . 

505 \ 

500 \ t1q rr^.- rvs lie Tyr 

«,„ v.X Pro Leu Pro. Glu Al* Pro Hie MetUys ^ ^ 

Oly val A*n Asn Pro Thr Glu Arg Ala Tyr\v*l Tyr Lys Glu 

530 Thr lie Asp W Glu Ser Ly= Gin Pro 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp ^ 560 

545 550 ThT val PrC Leu Val Ala His ser 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pr* ^ 

565 , ser ProVyr Asn Pro Ala Gly 

Met cys His Lys Trp Ala Gin Gly Ala Ser Pro y ^ 

580 t!,! Wis Gin pVo Asp Arg Phe Asp 

lie Asn val Thr He Val Glu Met Lys Hxs Gin .V _ 

595 rr- ir a i as* lie Leu Gly Ser 

Xle Arg Gly Gly Ala Ly= ser Axa Glu Kx. val As* I 

6 10 615 . Tlp aia SerXGly Asa Gly Asp 

Aia Glu Leu Asn Asp Tyr lie Leu Lye Xle Ala SerX 

"5 630 , " Se _ cm V" val Ser Gln 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Se. Gin t . 

645 S5 ° 
Met Pro Phe Pro Met 
S60 
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<21C> 3ir 
<211> 2312 

<21?.> otnomic DNA \ . 

<213> ichizosaccharomy^s potrie 

«=400> 3 \ 

„occ=xcxx «Wxc*x 

„CXCCX»C -^™ C CCTTT5AGTC 

M c,*ccxxc M ccxcc=». — ™ 
TCT ,^ TITC ™*«x ««W xxgcscgcx* ™— 

Xt^CX TXTXXXXXCG CXGXxUg* GXXXXCG.CC 
. CTGCT^CGTT — CW ^ OTOT " 

g^c.xxk — *xxx -xx.U GXGXXX^= — ^ 

,«X„CGC,.C ~GCCXXCCC «X««U CGO*™ GXXC^GXXC 

• ^titcoVSL GTGACCAXCC TCTTAXTAXG 
GTCTTGATAT. GTACAATGAG GGATATCG^. GTG 

^ ^G^XXAgV AGTTGGTCGX XT AAT AATX G 
GTTCC-TGGTG-TTATCAGCTC AGGATTAGfl* A 

M GC.«GXX O.XGCXCGcW* XXX — " 
_.CCG — 

ccxxgcxgo >xxggxx*xg — ~ 

GCXXCXX.CO ,XXO«= = XX ^X.XGC, «W« 

»^ TGXTCATXGA GxAcAGCAAC AXXGXACATA 
AXATTTTTCA AAGXTAAAAA IGTTC AT x G A ^ 

rrG a XT CAC P» GGXXACGTAC 
„^ t, rTCTTGATT TCTCACTCCA XGGGTTCA<-n 
AGAAAAAGGT AG iCTTOAl J. \ ' 

. XATTTXXTTA AGTGGGTTGA WJCW^ ^ 

xx=™x g^x.xxg cxcg-ggxx 
gxxxg.cx*c gxxxcx^cx xxxg^g* x„=gW xxx.™ 

~r.r r-TGAAATGA AAGATACAGG 
, „ * ' r trTGGCAGC GCXTTTATCG GuTC-.W 
GCACCCAAAA. CAG iGGCAG ^ T ^ T \ G C TAACCGTTTT 

TATTGTAATT ACATTAAACA TGTTAATATX TAAXTTXTG 

^ GAAXCAGTTT XCGGXCTAXG GGXAAGclx ^XXGXXGA 
AAGCXCAATT G^XCAGTX ^„*L rTCCC= T T« 

G ,rnvmc x^txtactg tttnwttoo ^ J tttaata 

GAGGXAXAXX — 

gagagccatg axggxxcgca cxaxggcagg ^™ 

• .AGC-AGGCGA XGXXGXAXCG GGAAAXGCCA GX.CGGXAAY— ■ 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
' €50 
700 
750 

eoo 

850 
900 
050 

1000 

1050 

1100 

1150 

1200 

1300 
1350 
14.00 
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TGTTAATTTT TTATTAATAT TTAgIcTCCA GATGATCTTA ATCAAACAAA 
TTITTCCAAT GGTGCAATTA TTCGaVaTAG AGAAGACATT GATAAGGACC 
ACGATGAAYT TGACATAGAT GATGcA?TAC AATTTTTAAA AAATGTTACA 
GATGACGATT TTAAAGTCAT G CT AG CGAAA AATTATTCC C ACGGTCTTGC 
TTGGACTGAA AAAGAAGTGT TAAAAAaVa CGAAATGCCG TCTAAATGGA 
TAAATCCGCT AGAAGTAAGA ACATYAAAgV TACTAAATTA TACTAACCCA 
^TAGACTAG TCTTCCTTAT GCTCCTGAT^TGAAAATTTA TTGCGTTCAC 

cgggtcggaa aaccaactga gagaggttatWattatacta ataatcctga 
■. ggggcaacct gtcattgatt cctcggttaa Vgatggaaca aaagttgaaa 

ATGTGAGAGA AT T T ATGTTT CAAACATTCT aWaACTGTT TTATTAGGGT 
ATTGTTATGG ATGATGGTGA TGGAACTTTA COAATATTAG CCCTTGGTTT 

ggtgtgcaat aaagtttggc aaacaaaaag gt^taatcct gctaatacaa 

GTATCACAAA TTATGAAATC AAGCAXGAAC CTGaTGCGTT TGATCTGAGA 

„ >P , r rTrrAT ATAOSTGGAC ATT C AG AG CT 
GGAGGACCTC GCTCGGCAGA ACACGTCGA1 

AAATGTATGT T C ATT TT AC C TTACAAATTT CTATT. 
AAGGAAATTA TTTTAAAAGT TTCATCAGGC CATGGT 1 
CCGTTATATA TC AG AT AT CC AGTACGGACA TAAOTTT 
TAACTAACTA ACCGAACAGG GAAATAATAA AT GAG AT 
CCTAGAAATT AA ■ 



VCTA 



feACT 



:A CTCTTGAAAT 
CGGTACCAAA 
AGATTGCAAT 
TCTCGATAAA 



tGT 



14 50 
1500 
1550 

1600 
1650 
1700 
1750 

laoo 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 ■ 

2312 
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<210? <h" 

<2ll> 3685 

<2\2 > genor.ic DMA 

<213> Arabidoosis thal^ana 

<400> 4 



GAG^lAACCAT 
GCAAAAGAAA 
GAGGAGGGAA 
GTGTGTGTAA 
\CTGC GAGCTTCCCT 
CGCCCGGTGT 
GTCTTCATTC 
ACAATGCGCT 
GTGAAGTCTA 
GGGATTTGGA 
TCACTCAAAC 



TTGTTCAACAi 



CTAGTTCCTT 

TCAATTTCAA 

CATCCTCGTT 

TATATTAAAA 

TCTTCTTCTT 

AATGTTAXTG 

TCAATTATCT 

TXGACAATGA 



TAGTTTTTXT 
CTGTATGTGT 
CTGTTTGATT 
GTCATCTTAT 
AACAAGTTAC 
CGTATAATGA 
CTACATCTGT 
TTATTCAACT 
CAGCTCAATG 
CGAGTTTATT 
GCTTTGAACT 
TTGTGGGTTG 
GTTCATGTTT 
CAGTGGTATA 
GGCCTCTATG 
GATCCAGCTG 



(TTTXTTTGAT 



.aggtaagcg 
:aacttta 

TcJfcCAAATCT 
TTCCTTTACT 
AGT^pTGTTT 
GGAA?ipTGTG 
ATGTA^TGT 
TGCTTT3CTC 
CACATGT^TT 
:c 

CATCTTTGT^ 

xtgctatgg"] 
gttgatagta^ 
ttgggtggaa 
gtattagagt 



50 
100 
ISO 
200 . 
250 
300 
350 
400 
450 
500 
550 
600 
GSO 
700 
750 
BOO 
£50 
. 900 
350 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 



xcag=*cx=g xggcxgcxL cxacxxxgcx ccxgcgtacx xxgxgxgggc 

AGXGGXGAXX CCtUCCTAcmKTWO AXAXGAAGAG AWAXAXGX 

«„«™ .t^Wct txcagaacac agaggxxgxx 

TCXTXCXATX W« «OTT«««T TCTTTCTTCA 
XX>GXTAAGG «»»■»» XtWgTXG AAXXAAXAGG XACGXGAXCA 
GACXCXXAGG CGXAXGAAAA GTAAtAXAGA GXXGAXGGXX XCTACCAACG 

gtcgaaaaaa agcagtxaxa cxtcgWx ccaxgggggx cxxgtaxxxx 

CTACATTTTA XGAAGXGGGX XGAGGcW GGXCGXCXGG GXGGCGGGGG 
. XCGGCGAGAT XGGXGXGCAA W^W GGGGGTGAXG AACAXXGGXG 
GACCAXTTGT XGGTGXTCCA AX*GCXg|& CAGGGCTTTT CTCXGGXGA, 
GCAAAGGAXG .TXGCAGXTGC CAGGXATXGA AXAXCXGGXX AXACXXXXGA 
XGAXGACAAG CTTCGCTCTG GAAGXGAaA XXATXCXACX AAAXAXCAAX 
XGXAAXAACA XXGGXAXAXX XT-C^W*™ XXGAXXAXXX 

i\ rTCTCTTOA k-TTAC-ACAA ATGATGAAXT 
TTGCTGCTTA TOTAACTGAA ACTCTCTTGA ^ 

GAX.AXXCTX ACGCAXXGCT CXGXGAXGAG CAGXXTGXXA GCXXCGACGA 
XAACATXXGX GAXACXGXCX XXXGGA==GC AxWxXXX GCXAXGGAAA 
GCGCCGAGG XXCCAXGGXX GGAXXGXXXA GCaWxAGGG XXAXXCXGGX 
XCXXXCAAXT XXCXTGXAXA XGCAXCXAXG GXCxWxX XCXXCXXAAT 
XAAAGAGXCG XXGGAXXAGX XGGXCTAXTA GXCAG^GGX XGCXXAATAX 
■ AGAACXXTAC XXXCXXGGAA AAXXGCAGAG CGATxW X«XTT«TX 
GACACCGAXA XATXTAGACT XCAGAGGXXG CAGCAT-W 
ACGCACAXGG =ACXGAA=AA XGXCXAXGXX ACCGAAGGGA GCXGAGACGA 
XAXGGGGCGG GGXXGATXGG XCACCGGAGA AAGGCCAG^ CXGXXGXGGG 
AAAXAGCAXA AGAACAACGA AACXXGXGGX GAAGCAGGxi AAAACGGAGX 
XXCCXAGAAA AGXGCXGXXA ACXAXGGAAG GAXGAXAXCX = 
AAGXAGCAGA CGCXGGGCCA XCXGAGAXTA AXAAXATXGA VxXCGAGXA 
XCGACAXAXA AAXCAXAATA AACCXXGTAC AXTXXGXGAX X*XATGATGA 
AX ? XGXGTA= AXXXXAXGXG GTGAAGGSXG , CXGXGAAAGG XcWgXAXC 
CCXXKCXCX CCXGXCGTGA CGXGXGGACA GAGXACGAXG AC^GGGAAX 
- XGCXGGGAXC AAAGGTAXCG GXGAG T XXAA GGXCXACACX CCX^ 



IdOO. 

1450 

1500 

1550 

2600 

1650 

1*700 

1750 

1800 

1B50 

19 00 

1950 

2000 

2050 

2100 

2150 

2200 

2250 . 

2300 

2350 

2400 

2150" 

2S00 

2550 

2600 

2S50 

2700 

2750 

2300 

28S0 
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■4 



UJ 




2900 
2950 
3000 
2050 
3100 
3150 
3200 
3250 
3300 
3350 
3400 
3450 
3500 * 
. 3550 
3600 
3550 
2S85 



# 
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<210> S J>- 
<211> 402 
<212> CDMA 

<213> Arabidopais chaliana 
<220> 
<221> CDS 

•<222> (120) . . (402) 
<400>. S 

MWACMacrnaicicT CTCGACTGA^CTAAGAATCC CTAATCTGTGTTCTAAATTC 
CTGGACGAGATTTGACAAAG TCCGTATAGC^AACCTGGT TTAATTTCAAGTGACAGAT 



SO 
11S 



ATG CCC CTT ATT CAT CGG AAA- AAC 
Met Pro Leu He His Arg Lys Lye 



ACG CAG AAA CCA TCG ACG CCG IS 7 
era Thr Glu Lys Pro Ser Thr Pro 



_ _ ft w rlT Trn CA a AAA CCA CAC 215 

S£ iS Si SIS 51! SS MS if, S£ ^« ^ - «. 

xc, xec ^ tcc eje «. J» -V- - - =f y £ £ 

Glu Ser Ser Lys Ser his His i->ys rrr * 

„- - rr TT r ArT GGG TGT GTG TGT GTA ACC • 3 11 
TCG TGC ATC GAT TCT TGT TGT TGG TTC Mr ^ 
Scr Cyc Ilo Asp Ser Cys Cys Trp Phe lie Gly Cys vai y 

_ _ ^-k RTG rCT GCG AGC TTC CCT 

TGG TGG TTT CTT CTC TTC CTT TAC AAC GCK ATG CCT C ^ 
Tro Trp Ph= Leu Leu Phe Leu Tyr Asn Ali Mec Pro ax 

CAG TA'r GTA ACG GAG CCG AAT CAC G»G TcAtTT CCC TTA CCC G 

.Gin Tyr Vnl Thr Glu Pro Asn Has ??? Serine 



159 



402 



52/59. 



m 



□ 

; SE =i 



<2io> e/r 

<211> 643 
<212> cDKA 
<213> Zea mays 
<220> 
<221> CDS 
<222> (1) , . (402) 
<40Qp 6 

CGG GAG AAA ATA GCT GC^ 
Arg Glu Lys lie Ala Al< 
1 5 

GAT GAA ACT GTT CCA GTT \ CTT 
Asp Glu Tar Val Pro Val \Leu 
20 

TGG CGT GGC AAA ACT CCT 
Trp Arg Gly Lys Thr Arg 
3 5 

AG A GAA TAC AGC CAT TCG CC 
Arg Glu Tyr Ser His Ser Pro\ 
SO 551 



GGG GGT GTT TAC TTA GCC GAT GGT 

Gly Gly Val Tyr Leu Ala Asp Gly 
10 15 

GCG GGC TAC ATG TGT GCG AAA GGA 

Ala Gly Tyr Mec Cys Ala Lys Gly 
25 30 

CCT GCC GGC AGC AAG ACT TAC GTG 

Pro Ala Gly Ser Lys Thr Tyr Val 

45 

TCT ACT CTC CTG GAA GGC AGG GGC 

Ser Thr Leu Leu Glu Gly Arg Gly 
60 



ACC CAG AGC GGT GCA CAT GTT &AT ATA ATG GGG AAC TTT GCT CTA ATT 
Thv Gin Ser Gly Ala His Val P\sp lie Mec Gly Asn Phc Ala Leu He 
65 70 \ 75 60 



49 



56 



144 



192 



240 



2SS 



GAG GAC GTC ATC AG A ATA GCT GdT GGG GCA ACC GGT GAG GAA ATT GGT 
Glu Acp Val He Arg lie Ala Alk Gly Ala Thr Gly Glu Glu He Gly 

85 \ 90 95 

GGC GAT CAG GIT TAT TCA GAT ATA VTTC AAG TGG TCA GAG AAA ATC AAA 3 3 6 
Gly Asn Gin Val Tyr Ser Asp He Fhe Ly3 Trp Ser Glu Lys He Lye 
100 ^0 5 HO 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG TTTATTGCGTTCC 3 91 



Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACTCTGGAT TTiGCTTAAA TATTGTAATT TTTCACGC 44S 

TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CGATGTTG 5 07 

TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG T AAGTTG CAT TTGCCAGC 5 65 

TGAAATTGTG TAGTCGTTTT ' CTTTACGATT TAATANCAAG TGG CGGAGCA GTGCCCCA 623 



AGCNAAAAAA AAAAAAAAAA 



643 
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<aio> i 

<211> us 

<212> PRT 

<7L11> Zea mays 

c400> 7 



Arg- Glu Lys He Ala Ala Leu Ly\ Gly Gly Val Tyr Leu Ala Asp Gly 



Asp Glu Thr Val Pro Val Leu SerXAia 
20 W5 



Gly Tyr Met Cys Ala Lys Gly 

30 



Tro Arg Oly Lys Thr Arg Phe Ser Ffcro 
35 40 



Ala Gly Ser Lys Thr Tyr Val 
45 



Arg Glu Tyr Ser Kis Ser Pro Pro 
50 55 



SeV Thr Leu Leu Glu Gly Arg Gly 
60 



Thr Gin Ser Gly Ala His Val Asp 
65 70 



lie V ec Gl Y Asa phc Ala Leu IlG 
75 80 



Ilu Asp Val He Arg He Ala Ala Gly Ala 



Thr Gly Glu Glu He Gly 



65 



95 



Gly Asp Gin Val Tyr Ser Asp lie 



Phe LyA Trp Ser Glu Lys- He Lys 



100 



110 



Leu Lys Leu 
115 
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<210> 6 

<211> SIS 

<1\1> cDNA 

<213=> Kcurospora crasaa 

<4 00 > S 



ggtggcgaag 
agatggatct 
gthtacngtt 
tggcgcccga 
actttgactn 
tggcacagtg 
aaatg aag ag 

C CG CAT G A AC 
CTTAAATATG 
ACATAGGTTA 
&JSAAAAAAAA 



TGACNCTCGG 5 0 

cagcctc'aag ioo 

TACATCTATC 150 

GATCGATACG 2 00 

GCGAAGGCGA 2 5 0 

AAGGGGTGGA 3 0 0 

GGTCGAG ATG 3 5 0 

ATACGGCGGA 50 0 

TAAATACGGC 5 S 0 

TTTTTTCTAA GOO 

616 
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# # 



<210> 9 Jtr 
<211> 

<212> genomic DNA 

<213> Arabidopais ^haliana 

<400=> 9 



: -8 

5 3 y 



ATGAAAAAAA 
GGTGACGATG 
TGATTCTGGT 
AGAGAATACA 
TCATAAGAAG 
TATTGTCTCC 
GACCfTni.TT 
TCCTCATTTC 
GGTTAGTACT 
AAATAGACAT 
GCTAGTCGGT 
GTGATTTTCC 
CTATGTCATG 
GGTTTGATTT 
T CTAG AGAAA 
CATATGATTT 
GCCTCACAGT 
CGAGAACGAA 
TTTTCGTCCT 
TACATCAAAC 
TCAGATGAAG 
ACCCTTTGCT 
CT ACT T C CAT 
AACTCCCCAG 
AC ATT GG AT T 




TATCTTCACA TTATTCGGTA 
ACCTCGATGT CBTCAAGCTGT 
T CC AGGAAAC GqAGGTAACC 
AG C C AAG TAG TGT^CTGGTGT 
AGTGGTGGAT GGT 
CTTCACCAGG TGCTT^CAGCG 
TGGATGATTA CCAAAATGCT 
GGTTCGACCA AATCACT 
TTCCAAGATA ( TATCA' 
AAATTTGGGG GATTATTGTT 
AATGTGAGTG TTATGTTAGT 
ATTTTAAATG AAG CTAG AAA 
AGAATTATAA GG AC ACT AT G 
G C AG AG AT G C CACATCTTAC 
AAATGCGGGT ATGTTAACGA 
CAGGTACGGC CTGGCTGCTT 
TCCTACAAGA CCTCAAACAA 
G GAAAG C C AG TGATACTCCT 
CCATTTCCTC AACCGTACCA 
ACTTTGTTGC ACTCGCTGCG 
ACATTTGCTT CTGGCAACAC 
GGTCAGACGG CAT CAG AGG A 
CTACCAAAGT GTTTCACGAC 
GTTAACTACA CAGCTTACGA 
CTCACAAGGA GTTGTGCCTT 



GT CAT AG CG A 
GGGTAGCAAC 
AG CTAG AGGT 
AGCAGCTGGT 
ATGGTTCGAT 
ATCGAATGAT 
CCTGGTGTCC 
ATACCTCGAC 
GGACATTTGC 
ATATCAATAT 
lT AGTT AAT G 

g\tgtcgttt 
taj^tgtagc 

ATGGJ^ACATT 
CCAAAO 
CGGGCC?\CCC 
iAA 

CTCCCATAGXj 
CCCCTTCATG^ 
CCATGGGGTG 
ACT CGGTGT C 
C C T C C GAG AG 
AGAACTAAAC 
GATGGATCGG 
ACAAGACAAG 



TTGGTGGflj 



50 

100 

150 

200 

2S0 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

850 

900 

250 

1000 

1050 

1100 

1150 

1200 

1250 
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TTAACAGAGG AGCTGATGAQ TCGGGGAGTG CCAGTCACTT GCATATATGG 

GAG AG GAG T T GATACAGCGG WsGTTTTGAT GTATGGAAAA GGAGGATTCG 

ATAAGCAACC AG AG AT X AAG pA^GGAGATG GAGATGGGAC GGTTAATTTG 

GCGAGCTTAG ■ CAGCTTTGAA AGQTCGATAGC TTGAACACCG TAGAGATTGA 

TGGAGTTTCG GAT AC AT C T A TAQTTAAAGA CGAGATCGCA CTTAAAGAGA 

•TTATGAAGCA GATTTCAATT ATTAATTATG AATTAGCCAA TGTTAATGCC 
GTCAATGAAT GA \ 
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<210a 10 & 

< 2 J 1 > 3396 

<212i> gcnoric DS'A 

<212> Arabidopsis 

<400> 10 



ihaliana 



SI 

» S 



La 

r = E 



ATGGGAGCGA 
TTTTTTCTTG 
TTCACGGCGA 
TCGACGCAGC 
CGACTTCAAT 
ATCTTCATTT 
AATTCCAAGC 
TCGTTCATTA 
AAAACAGCTO 
TAGCGCAATG 
CAAGTTTTTC 
CTTGTGCATT 
TGGTTTACAG 
TAGATCCTTA 
AGTGGTCTTT 
TTTCGGATTT 
TGTGATATAA 
TTTCTACTGT 
GAAGCAAATG 
CAAATTGGAA 
VCAGGCTAAT 
TCTGGTCGTC 
CCGTGGCGGC 
TC AG ATACTT 
TTGAAGTGGC 
CCTACTATCC 
TGTTGTGACT 
GAGCTCCTCT 
GTAACGTTTG 
TTTAAHTAHT 
TGAAAGTATT 
TAGATCTTGA 
GTGAGCGGCC 
TTCAGATTAT 
CTTTTTATTT 
TGACATGCGC 
GGTTGTTGTC 
AAGAATTGCA 
TGCAAAGAAA 
CAAAATATTC" 
ACTAGCGGTT 
GGAAGAATGT 
TACAGAAACA 
CCCTTTTGTC 
GCAATAGAAG 
GAAGTACGTA 
TTGCTGGCTT 
TGTTCATATG 
CCTTATTATT 
GTTTACAGTT 
ATTCTCTAGT 
TCTTTGTQAA 
TGATGACCCT 
AAAATGTATT 
ATTCTCAATA 



ATTCGAAA' 
ATTTGCGGT 
CTAGTCGAAG 1 
TACGAGCGTG 
CCGCYCGACC 
CCTTCGCTCC 
GAAATATAGC 
GTCAACAGTG 
ACTCGGCGAG 
AATGTGTAAT 
AG AGTGCTC A 
GTGATTCTTT 
CTTCTTTCTG 
TAATCAAACA 
CAGCCATCAC 
TTCTTTCTTT 
TATGGCTAAG 
C TG G AAAG AG 
CAATTGTCGC 
OAGCGTGACC 
GTCTTTTATC. 
TTCCTTTTTG 
CCTTCTATAG 
TCTGGAATGG 
T T G AT C AG C A 
TTAAGTTACC 
TACTGGATTG 
TCTTGGTTCT 
GCCTTCCTGT 
TGATATCA^.C 
ACTTTTGTTA 
AGTGCTAGTT 
TTAGCTAATA 
TATGGTAGAC 
TAATAGGCTA 
TTCTCATGTT 
CAATTCTTTT 
AGGGTGATAA 
GATAAGCGCG 
TGGCTGGCCG 
AGACTCTGTA 
TCACTCTCAT 
GCTCTAGTCA 
TTTCACAGCC 
ACTATGACCC 
CCTTTCTTTG 
CTTGTACGTC. 
CTTTGTCTTT 
GATTATCAGT 
ATGAATGCAA 
TTGTTTTGAC 
TTA.TATATAA 
GTTTTTAATC 
TTGCATATAT 
TCACATTATG 



AGT AACGG CT 
GCCGAACTGC 

;tcgatcctt 
cgtatggct 
'attctgtc 

A^TGAAGCAT 
AQGCTTCTGA 
TCTTTCCC'AT 
TAGTCTGCGC 
ATAGTAGTTA 
TGGTTGTTGC 
CTGTCAACTG 
GACCkTCCCG 



"GG AT 



rcT 

VTT 

;t 
\c 



AG AA' 
TGAGT 
TTCAT 
TGGCT 
.TGTTC 
TTTACTTT 
TTCTCTTT 
CAGGTTGAC 
TATTTGCCCA > 
C TG AG G C T AG 
TATCCATGCT 
ATTTTATTTT 
AGCTCGATAC 
GTTGAGGCAA 
TTCTGAGGTG 
CAGGTCTTAT 
ATTGAACTGC 
ATCAAAGAAC 
CAACCAAACC 
TTTAAGTTGA 
TGATTTGTTT 
TTTTGTTGGC 
GCGTCGTCAT 
CACATTCTGG 
TATACCACTG 
ACAAATATTA 
TATGCAACTG 
ATTTCGTTCC 
ACATGACCAG 
CGTGAACTAG 
AGATAGCAAG 
TGATAAGAAA 
. AAATTGTTTT 
CTTACTATAA 
TCTCTCCTTA 
A-AG G GGG TAT 
TAATAGCGTC 
CATGCTAACT 
CTCTGACTCC 
GGTGCTCATC 
CGTTGACTTT 



TCCTTCACCG 
GGTGGAGGAT 
TAJXTCATSCC 
GACTGTCCAT 
AGACACCACT 
GGTCGAGTCA 
GTCTCGTCTC 
ATCTGAGTTT 
CGCTTTTGGT 
TTTTTATTCA 
GAAAATGTTA 
TTACTGATCG 
CTGGTTTAAG 
AGTGTAAGTC 
CCAGGTTACA 
TCAATTTGAT 
TGGTCAATTT 
GGTGTGTTGA 
G AT TG G AG AT 
CAAGCTCAAG 
ATGTAAGATA 
TTTGAAACTG 
TTCAATGGGT 
ATTGCACC 
'ATTTCGCTG 
'CTCTAATT 
cfaSATTTGTT 
TCAAATCTAC 
ACOTCTGACT 
AACTCACTGG 
TGTACGCGAT 
ATAT^GTGGG 
•ACAC 
CAAG ATACTT 
ATTGAAJ^TCA 
AAGGCT 
TGTGGCrtAT 
ACGCATTTTT 
TGATGAAGAG 
TTAACATTG 
TAACACTAA 
TTTGATGTGT 1 
CATGGAATGT 
CAGATGGGAC 
AG G ATGTT AC 
TATTGCTCAT 
GTTTAAATCT 
GAAACAAGTA 
TATTATGGAA 
TTTAGTTGAT 
AATTTTGTTT 
AT AC TTTTCA 
TTGGGAGAGA 
TAAAGACAGA 
G TT ATT AT AT 



TCATCGCCGT 5 0 

GAGACCGAGT 10 0 

CCC.N?T?CG5 ■ 150 

ACACTCCGTT 200 

AAGGTCCGTG 25 0 

CTTGTTGATG 3 00 

TCTTATT GAT 3 S 0 

AG AG TC AT AT 4 0 0 

TCGCTAAATG 45 0 

ACTAGATCTG 5 0 0 

GGTCATTTTA 5 5 0 

ACGTGATGGA 60 0 

TGTATGGTGC 65 0 

ACGGCCTGAC 7 0 0 

TAACAGGTAG 75 0 

ATCATCTTGT 300 

TCAGGTCCTC 850 

GTTTGGTATA 50 0 

TGTCACCAAC 35 0 

TTAGTCCTTA 10 0 0 

AGCT AAGAGC 10 5 0 

CTTT AAAACT 110 0 

AATAATGTCT X 1 5 0 

AAAAC ATT AT 12 0 0 

TTGGTACCGG 12 5 0 

GGGGGAGTTA 13 0 0 

GTTGATTTAG 13 5 0 

TCTCTCTGGT 14 0 0. 

TCTCTTT AGT 115 0 

ATTTTCCTTT ■ lb 00 

ATGGTATCTG 15 SO 

TAGTATACCT 16 0 0 

TGATTTAGTT 165 0 

TGACTGAAAT 17 0 0 

TGTGACATAT 17 5 0 

GGAACTGCTC IB 00 

GCC ATTTTCA 13 5 0 

CTGGGGGTGC 19 00 

G AATATCAAT 13 5 0 

AATTCCTTCC 2 00 

AAAAGTTTCA 2 0 5 0 

ATC CATC AGT 210 0 

■ GGCCTTCCCA 215 0 

•CTTTTCAAA 22 00 

.CCAGTTAAA 2 2 5 0 

r AT CATC AC 23 0 0 

cVaTA.TCAAT 2 3 5 0 

T AAT C AG AAA 2 4 0 0 

TGTCTTTTTC 2 4 50 

TGATTCTCTC 2 5 0 0 

TTQTAGCAAA 2 S 5 0 

GGTTGTATCA 2 5 0 0 

CCA&CTATAA 2 65 0 

GGTATGATGC . 2 7 0 0 

TCCCCA.TTTG 2 75 0 



# 
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GTTTGCAATA 
CCTATTAAGC 
GGTTATTACT 
CACGGATATC 
CCGCXATGGC 
GTGGCATGTT 
AAGTACTTTT 
AAGTGGGAAG 
ACCAAAACAA 
CTGCTTTTAA 
GATTGTGCAA 
CCTATAACTG 
CTTCTTGCAA 
CTTGCTATGT 
TGATTATGAA 
GCAAGAATTG 
CTCTTTTTTA 
' ' CTGGTTJVTGT 
' TCTCTGTACT 
GAAAATAAAA 
• TAAATGTTG A 
GCACCAAGGG 
GGGGJSAGAGA 



TCTTTTTGAA 
GTTAAAGGTA 
TTGCCCCAAG 
ATTT AT G AAA 
ACAAGTAAAA 
ATCTCAGTTG 
TTATCATTCC . 
AGGTGTTGCA 
AACTAACCCA 
AAAATTTGTT 

tatctgcagg 
gggatgagac 
actactgaag 

TCTCTAGTAC 
ATTGATCTCT 
GCTCGGACCT 
GTTCCTCACC 
GTTGATTTAC 
CCTCAAGAAC 
CAACAGCCAG 
TCATGAGCAT 
TTAAGTACAT 
ACCGCAGTCT 



TTA^GATTTA 
CTAAJ&TGTAT 

TGGCA^ CCT 
CTGAAGGTTC 

CAGGAAGGCA 

CATAAGCAAA 

. TTTTGAGCTT 

TGAAAC^PA 

TTTCTGjUpT 

TTAAGWP^GC 
TCTGGAAOTG 

GGTAAGCT 
ACTAAGAT 
ACTGCAATA 
TATAGGTAC 
AAAGTTAACA 
TT AT AT AG AT 
CTCCAATTTG 
TTGTATTAAT 
AACACGATGG 
GGGTCAGACA 
AACCTTTTAT 
GGGAGCTTGA 



TCTTCTCCCT 
GAAGCTGTCT 
TATCCTGATA 
CCTCGTGTCA 
AAGTCTTCTG 
TTATTAAACA 
AGTGOATGAT 
CACTTGTATC 
TCATATTATT 
GAAAAACTAO 
TGGTTGATGG 
GAAGTTGGTT 
TACTTGCTTC 
TGACTCTCCG 
CTATCATTCA 
TAACAATGGC 
lCAAACTTTAA 

:tctttctaa 

tTAAACGAGA 
.GCGACGTA 
?&ATAGCTAA 
G^kGACTCTG 

taAaagtggg 



TGCATCTTAT 

gtcataggtt 
attggatcat 
aggtaatttt 
tatcagtcta 
ac t aaaattt 
cagtggctta 
aaagataact 
aggagtagtc 
ttcatatctt 

GAACGCTGGA 
TTGAAATTAT 
TGGAACACTG 
CTACTTTTAT 
CTCTCTTGGT 
TCCCCAGGTA 
GTGTACTTTT 
AAATC AT ATA 
TTCTCATTGG 
CATGTGGAAC 
CATGACAAAA 
AG AG CAT T C C 
TATTAA 



2B00 

2650 

2000 

2950 

3000 

3050 

3100 

3150 

3200 

3-250 

3300 

3350 

3400 

3450 

3500 

3550 

3600 

3650 

3700 

3750 

3300 

3850 

389S 



m 



59/59 



• 



4 



<2 10> lll^ 
<211> 70S> 
<2i2> CDNA 
<2Us tomato 
<400> 11 




AGCATGATGT * ■ SO 

GTG AAGATAT . 10 0 

ACCTATTATG 15 0 

GGAGCTTGAT 2 0 0 

TGCGGGAGOT 2 50 

TCCAAGTTTG 3 0 0 

AAATGTTTGA 3 5 0 

CGTGGGCCTC 400 

ATGTGCATGG 45 0 

TTTTG CAAAA 5 0 0 

CAACGATGCA 550 

ATTGAAGAGT 600 

GAAACTTTGT £5 0* 

AGCGAAATGA 7 0 0 

709 



